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1e Role of Current Distribution in Cathodic Protection 
Homer D. Holler 


The paper outlines a procedure for determining current distribution over an electrode 
surface, as required in cathodic protection or in electroplating when the electrode potential 
bears a known relation to current density; and shows the relation of current distribution to 


resistance of current path and counter electromotive force. 
tion (without polarization) is also computed 


The primary current distribu- 
A method is suggested for determining the 


electrode potential-current density relation over an extensive surface in a uniform medium 
In a nonuniform medium, the determination of current density by measurement of electrode 


potential becomes complex. 
must be relied upon. 


I. Introduction 


The electrical requirement for complete cathodic 
pr ction of a metal from corrosion was demon- 
strated vears ago by Mears and Brown [1].'. Their 
work established a criterion based on the equaliza- 
tion of surface potentials, which is accomplished by 
polarizing the cathodic areas of the metal until their 
ntials become equal to the “open-circuit” 
ential of the anodic areas. As a result, the 
rent leaving the anodic areas, and consequently 
equivalent rate of corrosion, is reduced to zero. 
The mechanism of the process is based on increasing 
the polarization of the cathodic areas by the appli- 
ation of external current to those areas. 

Let us consider the potential * relations, when a 
voltage, 2, is applied to a galvanic couple in which 
the potential of the cathodic area is e¢, and that 
lof the anodic area is ¢,. If & is gradually increased 
from zero, current J will flow first to the cathodic 
urea When E>Eg, where Eg is the potential of the 
ouple and lies between Ce and e,.. That is, 


the 


Eg 


€a— ola 


€e Tt tole, 


the current. circulating 
before F is applied 

resistance of the anodic path 

resistance of the cathodic path. 


within the couple 


Only when E>e, does current begin flowing to 
initially anodic area, and this occurs when 
lars, Where J, is the total cathodic current; 
land ¢, becomes equal to &, the open-circuit potential. 
As the potential relations within a galvanic couple 
are such that applied current flows to the cathodic 
areas as required, it may at first appear that there is 
no problem of current distribution in the application 
of cathodic protection. 
res in brackets indicate the literature references at the end of this paper 


term “potential” used herein really means a potential difference; and if 
t is flowing, a polarized potential is understood 


In such a case 


That is, the problem seems | 


the potential eriterion of cathodic protection 


to be automatically solved. However, let us consider 
the typical condition when galvanic corrosion is 
most insidious, that is, localized pitting. Then, the 
anodic area is negligible, and the problem becomes 
a practical one of obtaining current distribution 
over the cathodic area, which is approximately the 
whole area. In any study of the factors that deter- 
mine current distribution over an extensive cathode 
area, the geometry and dimensions of the metal 
structure and its surrounding medium are determin- 
ing factors. It is the purpose of this paper to 
analyze the potential-current relations that control 
this current distribution, related to cathodic 
protection. 


as 


II. Dimensions and Current Distribution 


In cathodic protection, we deal with cells of all 
shapes and sizes. It has been theoretically demon- 
strated [2] that the over-all potential-current rela- 
tions in a large cell may not be truly represented by 
the results obtained in a model of a very much 
smaller size. This is true because the potential 
differences within a cell, through which current is 
flowing, consist of two kinds of components. One 
comprises electromotive forces which, for given 
current densities, are independent of the size of the 
cell; and the other includes those potential differences 
resulting from currents flowing through resistances 
which, for a given resistivity, are determined by 
dimensions. The smaller cell may therefore not be 
a true electrical model of the larger one for a given 
current density unless the resistivity of its electrolyte 
is so adjusted that the resistive components in the 
two cells are equal. 

If the required adjustment of resistance is im- 
practicable, the potential-current relations found in 
a small model may not be applicable directly toa 
large cell, A procedure for determining these rela- 
tions by direct measurement in the cell therefore 
seems desirable. This is particularly true in’ the 
case of electrical circuits such as those involved in 
electrolysis mitigation and cathodic protection of 
underground pipelines where the resistive com- 
ponents may not be realizable cn a laboratory scale. 
| Here we have large cathodic areas, and the volume 





of electrolyte is unlimited. In the application of 
cathodic protection, the position of the anode, 
through which the external current is supplied, is of 
considerable practical importance. It is the purpose 
of this paper to outline a method tried in the labora- 
tory, which might be translated into a field procedure 
for determining current distribution. Field experi- 


ence will then determine whether the laboratory 

procedure is applicable to underground conditions. 
Theoretical methods of computing current. at- 

tenuation along a conductor of great length fre- 


quently neglect the role of counter emf and polariza- 
tion [3], and consider the resistance as the only con- 
trolling factor. On the latter assumption, the current 
density at different points on the cathode varies in- 
versely as the resistance of the current paths from 
the anode to the respective points, in accordance 
with Ohm's law. However, any counter emf reduces 
the current in the same ratio as that of the counter 
emf to the applied emf at that point. Any increase 
in counter emf therefore tends to reduce the cur- 
rent at points of higher current density to a greater 
degree than at points of lower current density. 
This results in a more uniform current distribution, 
for a given applied voltage, than if resistance were 
the sole current-limiting factor. For example, if 
7, and 7, are the currents flowing to unit areas having 
electrode potentials ¢, and ¢, respectively, then [4] 
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where r, and r, are the resistances of the paths of the 
currents. For a given metal and environment, 


Of, OF; Or 
Oi, OF, O 


If d¢/0i becomes very large as compared with the 
resistances, r and ry, Of, O/, approaches 1. In electro- 
plating, this phenomenon is called “throwing power’, 
which is a function of 0e/0/, and the geometry of the 
cell and may be expressed inseveral ways. In cathodic 
protection also, a high value for 0¢/0/, as compared 
with resistance, favors throwing power and therefore 
greater uniformity in current distribution. 

In cells of very small dimensions, as in pits and 
crevices on the metallic surface, the effective re- 
sistances r and r,; may be small; and the value of 
0¢/0/ probably greater because of larger ionic con- 
centration gradients. These conditions favor a 
higher throwing power than over large areas free 
from sharp surface irregularities. 


III. Determination of Current Distribution 


In any study of current distribution, it is essential 
that a method of determining apparent current den- 
sity at any point be available. When current dis- 
tribution is uniform, 


, 


A 


where / is the total current; and A is the total area 
If 7 and A can be measured, there is no problem in 
determining the apparent current density. How- 
ever, When the current distribution is not uniform 
the relation of current density to polarization, jf 
there is such a relation, may be used to determine | 
at any point where the cathode potential e, can be 
measured. If the potential e. can be measured 
without including any resistive components, then the 
relation, ¢.=(f)7, may be used to determine 7 at any 
point in cells, regardless of dimensions. 
In order to demonstrate the relation of current 
distribution to polarization and resistance, a test cell, 
large enough to obtain a convenient current-density 
gradient, was used. It consisted of a wooden tank, 
3 ft by 10 ft by 1 ft deep, entirely insulated from 
outside circuits (fig. 1). A *4-in. steel tube extended 
the full length of the tank and a small steel anode, 4, 
was in one corner, as indicated. The electrolyte 
covered the tube by several inches. The problem 
was to determine the current density at points 1, 2, 
.. 10 inclusive, for a given applied emf, £,. 
Direct measurement of current density at any point 
was impracticable. However, if the relation of 
cathode potential ¢, to 7 is known, then 7 can be 
easily determined. In the present case, the current- 
density gradient along only one dimension is of 
interest. For this reason current density is ex- 
pressed as current per unit length of tube. Varia- 
tion in current density around the tube is disregarded 
In order to obtain the data for a graph showing 
the relation between e,. and 7, measurement of ¢, on 
a given length of tube, on which the apparent cur- 
rent density is assumed to be uniform, is necessary 
at different values of 7. The obvious method of 
obtaining such uniformity is by the use of a parallel 
anode. This was accomplished by replacing the 
small steel anode, A, with a rod equal to the length 
of the tank. Then ¢=//,, where / is the length of 
tube, uniform current distribution being assumed 
For the determination of ¢,, a saturated calome! 
electrode was placed on the surface of the tube at 





each numerical location, and the potential differ- 


ence (e.—,), was measured, using the circuit in 
figure 1, where E&, is the potential of the calomel! 


electrode. 


IV. Method of Measurement 


This cireuit [5] permitted measurement of th 
quantities («.—,) and (e,—F,+-Ir,), where r, is the 
resistance between the reference electrode and _ the 
cathode surface. For a given current, 7, the bridge 
was balanced by adjustment of Y until momentary 
closing of key AK), caused no change in the deflection 
of null-indicator G. Since the resistances in arms 
DD were equal (each 50,000 ohms), then at balance: 
r,=NX. After balance, the counter emf V, was 


ireuit 


rol VE 


Pr 


For 
ire dl a 


dd until there was no deflection of G. Then 


ut (1) 
(e.—E,)—V, 


ei T»)r TD, 


circuit: (2) 


V>.=(—I,)X—1,D. 


acting the equation for (2) from that for (1), 


nee Ps 4 


V_,= voltmeter reading. 
vey Ky be opened and V, be readjusted to a 
value Vy until G again reads zero, then 
Ir..=Ve, where VY, is the voltmeter 
ng. This is equivalent to a potentiometer 
wasurement. When /r, is negligible, use of the 
widge is unnecessary, and (e,—,) may be observed 
lirectly as V.», for all practical purposes. 
lhe instrumental requirements of this circuit are 
stringent. As null indicator G, a General Elee- 
rie galvanometer was satisfactory. In the field, a 
Veston Model 622 voltmeter with zero-center scale 
snd a resistance of 200,000 ohms/v may be suitable 
snd more convenient. A calibrated slide-wire rheo- 
tat of sufficient current-carrying capacity was suit- 
ible for balancing the bridge and reading. This 
reuit has been used in preliminary field tests but is 
ot vet in suitable portable form. 


" 


. Potential Criterion of Cathodic Protection 


For each value of J, the emf (¢.— /.), was meas- 
ired at each point in numerical order, and also in the 
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Circuit for measuring polarization at points of 
different current densities along a steel tube. 


reverse order, making a total of 20 readings. The 
results are summarized in figure 2, where two graphs 
represent the averages, one for tap water and one 
for O.1-percent NaCl solution The “breaks’’ occur 
at potential levels near but slightly below the open- 
circuit potential of iron in a saturated solution of 
ferrous hydroxide. At this potential, 0.813) volt, 
corresponding to a pll of about 9.5. the reaction 
Feet * + 2 reaches equilibrium, and corrosion of 
iron by this process practically ceases [6]. The data 
in figure 2 were obtained for electrolytes that were 
undisturbed, except by convection, and were very 
probably saturated with air. An air-free environ- 
ment is therefore not a requirement for obtaining 
the air-free potential by cathodic polarization. The 
reproducibility of the readings was better than was 
anticipated for iron in a solution initially having an 
undefined concentration of ferrous ions. 

The protective current, /,, indieated in the salt 
solution was about 2.4 ma/ft and in the tap water, 
1.5 ma/ft. Thus, the corrosiveness in the more 
conductive salt solution was greater than in the tap 
water, though not in the ratio of the conductivities, 
which was approximately ten to one. [t is apparent 
that this large difference in conductivity had no 
apparent effect on the potential at which the breaks 
in the polarization curves occurred 

Next, the parallel anode was replaced by the point 
anode A, and for a given value of &, and J, ¢, was 
observed at the numbered points along the tube. 
By the use of the graphs in figure 2, the corresponding 
values of 7 were then estimated for each point. In 
table 1 the values of ¢, and ¢ are given, for each point 
of observation, in a 0.1-percent solution of sodium 
chloride for a current of 12 ma. Since the potential 
of the tube =e, when 7—0, the difference Ae, =e, 
represents the emf of polarization, also given 
table 1. 
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Relation of electrode potential to current density 
along a steel tube 


Ficure 2. 


4, Tap water; B, 0.1-percent NaC! solution 





Taste |. Observed potentials and current densities at different 
points; calculated resistances and current-density components 


¢.=715 mv; ¢4=588 mv; BE, =1,170 mv; J/=12 ma 
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VI. Calculation of Resistance of Current Path 


Let us now analyze the relation of current-density 
attenuation to the resistance of the current paths 
from the anode to successive unit lengths of tube. 
For a given current 7, the polarized anode poten- 
tial=e,. Then, since the applied emf=F,, and the 
external resistance was negligible, 


EK, (@-—€.4) 


i (2) 


r, 


where r=the resistance of the path of current 7 
from anode A to a given unit length of tube. That 
is, 71,%e, ., Ty are the resistances of the paths of 
currents j),/:, . . ., di, respectively. Using eq 2, 
the values of r were computed from the correspond- 
ing values of e¢. and 7, where #,=1,170 mv, and 
€,—588 mv. These results are also summarized in 
table 1. 


VII. Polarization 


Rearranging eq 2, we obtain 


€4) _ 


E, (¢. 
— ’ (>) 


r 


where 7 consists of two components in opposition. 
One, /,/r, with &, constant depends only on r; and 
the other (¢,—e,)/r, with e, constant, depends on r 
and also upone,. The expression /,/r is the primary 
current distribution representing the current density 
that would be obtained if there were no counter emf. 
For example, if the cell were entirely filled uniformly 
with a conductor of the first class, or if it had iden- 
tical nonpolarizable electrodes, like copper in copper 
sulphate solution at low current densities, (¢,—e,)=0. 
Thus F,/r, is the largest current density attainable 
for a given value of &, and r. 

If ¢. >e, and e, is nonpolarizable, having a constant 
value é, then we have a constant counter emf 
(éo—eé,), as for example, in a lead-acid storage 
battery on charge at low current density. 

If ¢. >e, and e, is polarizable, then we have the 
present case, where (¢,—e,) represents the counter 
emf in which e.=é + Ae,. 
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The amount of polariza- | 
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Relation between emf and ir components of polari- 
zation along a steel tube. 
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Ficure 3. 


A, Open-cirenit potential of iron, 0.814 volt 


tion may be expressed as Ae, in millivolts, or as 
Ae./r in milliamperes. Both indices in table 1 show 
that the role of polarization in determining current- 
density distribution diminishes at the lower current 
densities. At the minimum current density, resist 
ance is practically the sole determining factor, as 
indicated in figure 3. 

The current-distribution curves for salt 
(fig. 4) were essentially the same as those obtained 
in the tap water (fig. 5). For example, cathodic 
protection defined as that value of ¢,, equal to or 
above 0.813 v, extended approximately to the 4-ft 
location in each electrolyte; and as shown in (fig. 2), 
a lower current density was required for protection 
in the tap water than in the salt solution. 


VIII. Location of Reference Electrode 


When the tip of the reference electrode was in 
contact with the steel tube surface, there was a 
small resistance r, (fig. 1) between the electrode and 
the surface, which increased with current density 
(fig. 6). On a copper tube, this resistance was 
greater by several-fold and may have been due t 
liberated hydrogen, or to a film resulting from 
increased alkalinity at the surface. 

When the reference electrode was located 18 in. 
from the tube surface, the observed readings were 
averages of the true potentials for a range of points 
extending over a considerable length of the tube. 
For example, in figure 6, the potential (800 my) 
observed at locations 1 and 2, with the electrode 
18 in. away was approximately the average potential 
over a 5-ft length as measured with the reference 
electrode directly on the surface. The significance 
of this observation is that, when the potential «nd 
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ve 4. Cathodic potential e., current density i, and pri- 
, current distribution E,/r along a steel tube in NaCl 
on. 


it NaC] solution; J==!2 ma 


urre nt-densit. gradients along the tube are steep, 
le reference electrode must be as near as possible 
to the surface in order that the observed value 
mpproach the magnitude of ¢, at a given point. At 
best, the observed value of ¢, is an average potential 
ver an area that increases with the distance of the 
reference electrode from the surface. The purpose 
of reducing this distance to a minimum is not that of 
reducing the effect of resistance, since this can be 
vilaneed out by the bridge measurement, but in 
rder to reduce the observed area to a minimum. 
viously, if the cathode potential is uniform, the 
requirement of proximity is unnecessary. .A more 
remote location may then be used, the distance from 
the surface being greater, in proportion to the area 
f uniformity. 

\ frequent subterfuge of locating the reference 
electrode behind the electrode under investigation, 
or at some other point of minimum current density. 

neither eliminate the ir component in the 
observed potential nor indicate the value of e, at 
the front surface of the electrode, which is of most 
importance. The reference electrode should there- 
fore be placed at the exact point, where the value of 
required, with the least possible disturbance of 
lines of current flow. This requires a reference 

‘rode with the smallest exploring tip that is 

ticable. 
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Cathodic potential e, and current density i along a 
steel tube in Washington tap water. 


Ficure 5. 


lap water; / 2ma 


IX. Suggested Field Procedure for Determin- 
ing Current Distribution 


In estimating the current distribution over a very 
extended cathode surface, such as a pipe-line or a 
very large plane surface, the cathodic polarization- 
current density relation may be very useful. For 
determining this relation, a procedure, based on the 
guard-ring principle {7} in electrical measurements, 
suggests itself. For example in figure 7, let a parallel 
three-section line anode be laid as near the pipe sur- 
face as possible without distrubing the surface en- 
vironment. The two end-sections will supply current 
to unknown lengths of pipe over which the current- 
density varies from a maximum to practically zero 
at some unknown distance away. By thus eliminat- 
ing the “‘end-effect” current from the meter reading, 
we know the length of pipe which is receiving the 
measured current /, being supplied by the middle 
section of the anode. 

The degree of uniformity of current density within 
the length /, of course, may not be predictable. — If 
the values of ¢. observed at numerous points within 
length / and the resistivity of the surrounding medium 
along the length / are fairly uniform, then assuming 
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Relation of potential readings and resistance r, 
for different positions of the reference 


Ficure 6, 
along a steel tube, 
electrode. 


A, reference electrode on sur- 
resistance r, with reference 


Steel tube in 0.1-percent NaCl solution; !2 ma; 
face; B, reference electrode ts in. from surface; C, 
electrode on surface 


that ¢, isa function of j the current distribution should 
be uniform. If it is, then a clearly defined potential- 
current density relation similar to that in figure 2 
should be obtainable by the technique outlined in 
figure 7. Such a relation may then be used to deter- 
mine current distribution beyond the ends of length 
/ in a uniform environment. 

In the case of a large plane surface, / in Figure 7 
may represent the diameter of a disk within a ring 
having a slightly larger inside diameter, both being 
in a plane near and parallel to the surface. The 
measured current, /, will flow to an area approxi- 
mately equal to that of the disk, the current density 

i being 4//r?. Then the relation between e, and 
may be established within the disk area by the 
bridge-method used in figure 1. If e, (fyi, the 
current-density gradient outside the disk area may 
be determined by measuring ¢, at points at increasing 
distances away. This is very much simpler than 
the classical but laborious procedure of plotting the 
equi-potential surfaces around the electrode and then 
graphically determining the current density lines 
perpendicular to them. 
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Suggested procedure for measuring current flowing 
to a known length of pipe-line. 


Fieurt 


X. Conclusions 


Using a reference electrode and a bridge method 
which eliminates ir components from the measure. 
ment, the relation between current density and the 
polarized potential of iron was determined in tap 
water and in a 0.1-percent salt solution. By the 
application of this relation to a large cathode receiv- 
ing current from a point anode, the current density 
at different points on the cathode was obtained from 
the electrode potentials measured gj the same points. 
The resistances of the current paths from the anode 
to cathode were computed; and the role of resistance 
and counter emf in current distribution, thereby 
demonstrated. 

By this procedure, current distribution may be 
determined, regardless of the dimensions of the cell, 
if the reference electrode is located very near the 
cathode surface. If the surface potential is uniform, 
the reference electrode may be located at some remote 
point. If the surface potential is not uniform, the 
reference electrode at a remote point will indicate 
an average potential over a given area. 

When the polarized potential is a known function 
of current density, the primary current distribution 
may be computed from the data obtainable by the 
above procedure. By using the guard-ring principle, 
the potential-current density relation, and current 
distribution may be determined over very extensive 
electrode areas in a uniform environment. 
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Multiplets and Terms in Technetium Spectra 
William F. Meggers 


Multiplets and terms of the quartet, sextet, and octet systems are reported for the 


first spectrum of Te. 


The combinations of 20 terms composed of 63 atomic energy levels 


account for 200 Ter lines and 66 percent of the total observed intensity summed for 1,300 


lines. 


The ground state of neutral Te atoms is represented by 44° 5s? Sy,, 
able state by 4¢°5s°Dyy, only 2573 em~! higher. 


and the first metast- 
\ Rydberg series of *S terms vields- 


absolute term values from which a Ritz-corrected limit of 60070 em~'!, or ionization poten- 


tial of 7.45 electron volts, is derived. 


In the second spectrum of Te the principal S and P terms of the quintet and septet 
1 | 


systems are identified and fixed relative to each other. 


Evidence is presented that 47° °D 


represents the normal state of Te* ions, but to confirm this further observations of the Te 1 


spectrum are necessary. 


I. Introduction 


1922 Catalin [1] ' discovered, in the spectrum 
anganese, that certain groups of lines having the 
same character are related by very exact numerical 
separations, and he suggested that the word ‘“ multi- 
be used to denote such groups. Since that 
ime the search for multiplets in all atomic spectra 
has been the chief objective of spectrum analysis. 
This search, aided by the quantum theory, has re- 
sulted in present-day information concerning spec- 
tral terms, atomic structures, ionization potentials, 
and many other properties of atoms, of ions, and of 
atomic nuclei. The purpose of this paper is to 
report on information of this type derived from 
preliminary analyses of the first two spectra of 
technetium, the most recently discovered chemical 
homolog of manganese. 

This chemical element was predicted in 1869, by 
Mendeléeff who found that when the then-known 
chemical elements were arranged in order of increas- 
ing atomic weights, their maximum chemical valen- 
cies exhibited periodicities, with gaps suggesting two 
unknown homologs of manganese. In 1925 Noddack 


and Tacke [2] announced the discovery of both of | 


these elements and proposed for them the names 
“masurium” and “‘rhenium.’” Rhenium soon became 
available in abundance, and in 1931 Meggers pub- 
lished a preliminary description of its are spectrum, 
also reporting multiplets, atomic energy levels, and 
the ionization potential for Ret. The separation of 
masurium from natural minerals has never been 
repeated or confirmed, but the element in question 
has recently been produced artificially either by 
transmutation of molybdenum [3] or by fission of 
uranium [4]. This artificial element was named [5] 
technetium (symbol Tc). A preliminary description 
of the are and spark spectra of Te by Meggers and 
Scribner [6] constitutes the material in which the 
first multiplets and spectral terms of Te were found. 

The basic spectral data consist of measured wave- 
lengths and estimated relative intensities of more 
than 2,200 radiations characteristic of Te atoms and 


ures in brackets indicate the literature references at the end of this paper 


ions. These spectral lines are divided into two 
classes, one (Te 1) characteristic of neutral Te atoms, 
and the other (Tem) of singly ionized atoms. <A 
considerable number of Te lines were observed to be 
complex (c) with hyperfine structure, and many were 
hazy (hk) in the spark spectrograms. Other criteria, 
such as reversibility, pressure displacement, temper- 
ature classification, and Zeeman effect, which are 
extremely helpful in the analyses of complex spectra, 
are still wholly lacking for Te lines. In the absence 
of these aids it might appear presumptuous to at- 
tempt the analyses of Te spectra, but fortunately 
these spectra are so well-observed and orthodox that 
the principal multiplets could be identified without 
difficulty. The first guiding principles in the search 
for Te multiplets were the interval and intensity 
rules, assuming that LS-coupling controls the optical 
electrons. The second clues to Te multiplets came 
from comparisons of analogous Mn and Te spectra 
as regards term intervals, term magnitudes, and 
electron configurations. Under these conditions a 
reliable list of lines, wavelengths, and intensities will 
yield atomic information, provided that the observed 
spectral region and range are right. This statement 
is illustrated by the atomic information derived 
from the analyses of Te 1 and Te u spectra. 


II. First Spectrum of Technetium, Tc : 


1. Tc: Multiplets 


The observed data of the Te 1 spectrum comprise 
about 1,300 lines with wavelengths ranging from 
2324.57 to 8829.80 A, and estimated intensities 
ranging from | to 500. Vacuum wave numbers 
corresponding to the measured wavelengths of Te 1 
lines were obtained from Kavyser’s Tabelle der 
Schwingungszahlen {7}. Like other first: spectra of 
metals, the Te 1 spectrum is weak in the ultraviolet, 
strong in the short-wave visible and adjacent ultra- 
violet, and lacks lines of high intensity in the near 
infrared. My search for multiplets in this spectrum 
began with the identification of a group of three 
lines (€°Sy,—2°P°su, a, ») Which are outstanding 





in the analogous Mn 1 (4030.76, 4033.07, 4034.49 A) 
and Re 1 (3451.88, 3466.47, 3464.72 A) spectra. 

Naively, I interpolated between Mn and Re 1, 
and looked for the analogous Te 1 lines between 3700 
and 3800 A, but failed to find them there. The most 
intense Te 1 lines have wavelengths 4297.06, 4262.26, 
4238.19 A. The wave number separations (189.9 
and 133.2 cm~') of these three lines are almost exactly 

roportional to 7 and 5, which characterize a good 
> term. The same separations were next found 
among seven weak lines (4948 to 5139 A), but the 
significance of these was not recognized until recurring 
differences (302.1 and 449.7 em~') found between 
wave numbers of strong lines (3500 to 4200 A) were 
seen to be identical with the cross differences of the 
seven weak lines. This led at once to the identi- 
fication and fixation of the metastable five-level 
term, a°D, and many multiplets that involve this 
term. Surplus a°D separations among other Te 1 
lines were regarded as belonging to intersystem 
combinations, which gave the clue to the fixation 
and identification of a metastable four-level term 
a‘*D, and multiplets that involve this term. 

Finally, a unique pair of visible Te 1 lines (5924.57 
and 6085.22 A), exhibiting the widest hyperfine 
structure, were assumed to represent the transition 
a’ Sox - sp, 


Taste 1. 


> §Ps, 
232654 


- SP, 


93455.3 


- 6p. 
4 Ih 


235885 


2 * Dj 


27 369.8 


2 ‘Diy 
2») 680.1 
> *Ds 
279408 
: “ry 
IS151.4 
> *D 
28796 7 


> FY 
30067 1 
2 *Fj 7 
3801331 
z * FS, 
B303882.0 
2 °F Sy 
528.8 


J 
z *FRy, 
0889 1 


£ Phy 
831414.0 
r *Ph 
-% 
31406.7 
r *Piy 
31503.8 


a "Sy, 
00 


$207.06( 500) 
23265.2 
$262.26) 400) 
23455.2 
1238. 10/300) 
235884 


27660.1 


2740.8 
3551.2001 
2S8151.5 


30382.0 
30528.8 


30630.5 


3182.38(30) 
31414.0 
3183.12(30) 
31406.7 
3173.30(40) 
31503.9 


a*D, 


2572. 0 


831.3313 


20602.5 


1031.63(1300 
24796.9 
3984.97(150 
25087.2 


3636. 10/300 
27494.2 
3627.30(25) 
27560.2 
3594.93(2) 
27809.1 


3466.29( 200) 
28841.1 


a *Ds 
3250. 4 


1995.00(15) 
20014.5 
148.0608) 
20204.3 


$145.02(80) 
24118.6 
1005. 680150) 
24409.1 
1049. 100150 
24680.9 


3718.88(200) 
26882.2 
3684.78( 100) 
27131.1 
3664.94(3 
27277.9 


3549.74(150) 
28163.1 
3550.66(100) 
28155.8 





This assumption was confirmed by the establ) .))- 
ment of ¢*S and e*D, which give with 2°P° = }y 
strongest lines in the octet system. The prinei sa! 
multiplets of the Tet spectrum are displayed jy 
tables 1, 2, and 3, where the term symbols and val. ¢ 
(relative to a °Sq,=0.0) are shown at the top sad 
left. Differences of these level values are entered jy 
the body of the table where the levels inters: 
These numbers are the actual wave numbers of |! 
lines permitted by the selection rule AJ=0,- | 
The observed wavelengths and intensities (paren 
theses) appear directly above the wave numbers 
The average difference between computed and ob- 
served wave numbers is +0.06 em~', which proves 
that the measured wavelengths of these Te lines 
are rarely in error by more than 0.01 A in relative 
value. The total number of permitted lines in 
tables 1, 2, 3 is 240, whereas the number of observed 
lines is 200. In a few cases the unobserved Te 1 lines 
are obscured by Te mn, by impurity, or by strong 
copper lines, but in most cases the absence of pre- 
dieted combinations is explained by the fact that 
they are either weak intersystem transitions or 
double-electron jumps. One line 4145.0 A is doubly 
classified; the calculated components are separated 
by 1 em~' but were not resolved because of unequal 
intensities. 
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2. Tc1 Spectral Terms 


The Ter multiplets in tables 1, 2, 3 result) from 
combinations of 20 spectral terms embracing 63 
atomic energy levels that are arrayed in table 4 
alongside the analogous terms and levels belonging to 
the Mn i spectrum [8]. The eleetron configurations 
in column 1 apply to Mni; for Ter the principal 
quantum numbers must be increased by one unit. 
Naturally, the intervals, between levels of complex 
terms are larger in Te than in Mn, but the algebraic 
signs, with few exceptions, are the same. A_ plus 
sign means that the term is regular, minus means 
inverted. The Mnt terms are arranged in order of 
increasing magnitude of levels having the largest J 
but the analogus Te 1 terms depart from this order 
because the sixth d-type electron is more firmly 
bound in Te than in Mn. Thus, in Mniu 3d° 4s "Dy, 
is 17052.3 em”! above the ground state 3d° 48° "Sau, 
whereas in Tet 4d° 5s “Dg: is only 2572.9 em™ 
above zero. Similarly, the associated *D», levels 
are, respectively, 23296.7 and 10516.5 em~' above 
zero. It appears that terms from configurations 
containing six d-electrons are displaced about 13,000 
em”~', whereas those from configurations with five 
d-electrons are displaced only about 2,000) em '. 
These displacements are shown in the last column of 
table 4; they served as a guide in assigning Te 1 terms 
to the electron configurations in the first’ column. 
After the relative energies of d®°s and d®' s* had 
been determined for Te as described above, it: was 
called to my attention that an approximate value 
of this difference was predicted in 1928 by Gibbs 


and White [9]. 
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Taste 4. Spectral terms of Mn 1 and Te 1—Continued 


Mnit 
Electron Term 
configuration symbol 
Level Interval Level Interval 
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partures from the Landé interval rule occur in 


-pectra, but they are occasionally much greater | 


1 than in Mnir. For example, the interval 
en the two largest J values of 2 °F° is abnor- 
small in Te 1, and 2°P® is only partially inverted 
I Despite careful search the J=3's level of 
could not be found. The intervals in y“*D® 
xtremely irregular in both spectra. On_ the 
both term intervals and line intensities indi- 
that, to a large degree, the optical electrons of 
toms obey LS-coupling. It is planned to test 
1y observing the Zeeman effect, and incidentally 
eck the proposed designations of Te atomic 
\ levels. 


3. First Ionization Potential of Tc 


two or more spectral terms form a Rydberg 


too-large value for the limit, which is more accurateiy 
given by a Ritz formula of the type y= L—R(n+ wo 
aT)*. In the spectral series for 37Rb, 46Pd, and 
47Ag the Ritz correction @ has an average value of 
2.0% 10°. Assuming that this correction is valid 
also for 43Tc, the absolute value of the aS, term 
of Tet is found to be 60070 em~ and the corres- 
ponding ionization potential is 60070/8067.5 —7.45 
electron volts. 


| III. Second Spectrum of Technetium, Tc I! 


s, that series may be extrapolated to a limiting | 
number that represents the energy of binding | 


ic last bound electron. In complex atoms, such 
hose under discussion, the number of excited 
es is relatively large in successive shells, and the 
sitions from high terms in series either give faint 
lines or are not observed. However, as long ago 
is 1894, Kayser and Runge [10] detected the first 
spectral series in Mn. That series, now interpreted 
as 2 °P°—n SS, permitted a calculation of the absolute 
value of 2*P°, and then by addition of the inter- 
system combination a°S—2z*P° the value of the 
term @°S representing the normal state of the atom 
was arrived at. A similar procedure is now possible 
with Te 1 because two multiplets (of three lines each) 


spectrum. 


| limit 


1. Tcll Multiplets 


Within the wavelength limits 2261.30 A> and 
4434.96 A about 800 spectral lines have been defi- 
nitely identified as belonging to singly ionized Te 
atoms, the Tem spectrum. The short-wave limit 
was imposed by the smallness of the Te sample and 
by the declining efficiency of photographic plates 
and spectrograph employed in recording the spark 
The paucity of enhanced Te 1 lines in 
the visible spectrum is responsible for the long-wave 
of observation. The measured wavelengths 
of Teu lines were converted to vacuum wave 
numbers with Kayser’s [7] Tabelle der Schwingungs- 
zahlen. 

Among the observed Tei lines there are two 
ultraviolet multiplets, each of three intense lines, 


| which were immediately recognized as the combina- 


| and 


have been found and interpreted as combinations of 


terms e*S and /°*S. 
nS, as well as the inter- 


A 


successive series 
= spe r 


‘P° with 
These combinations, 


system lines, a°S—2z*P°, are shown in table 3. 


Rydberg interpolation table applied to the series | 
lines vields the following approximate values for the 


*P° levels of Te 1: 
a » 
s} oie 
hy 


‘sp 
~ 2% 


42766 
43415 
43860 
Adding the wave numbers of the resonance lines, we 


obtain the following absolute values for the a°S., 
term: 


2 Fs. 


60290 em~! 


+(a"S.,,— 2 *P3..) =434154+ 16875 


, 
s Pix 


60289 em~™'. 


+- (aS... — 2 °P2,,) = 43860 + 16429 


Assuming that 1 electron volt 8067.5 cm™', this 
value of the deepest term in the Te 1 spectrum 
corresponds to an ionization potential of 7.47 electron 
volts. 

Experience tells us that extrapolation of a two- 
term Rydberg series generally leads to a slightly 
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tions a ‘Ss; (2647.02, 2610.00, 2543.24 A) 
a *S,- (3195.21, 3212.01, 3237.02 <A). 
In the Mnut spectrum a *S, is the ground state, and 
9473 cm” ' lower thana *S,. Assuming that a similar 
state of affairs could be expected in the Te 1 spee- 
trum, combinations between the quintet and septet 
terms were sought in order to fix their relative values. 
Fortunately, these intersystem combinations were 
unmistakeably represented by two outstanding lines 
(3892.14 and 3975.02 A), at the short-wave edge of 
the visible spectrum, and by the two most. prominent 
lines (2298.10 and 2285.47 A) very near the ultra- 
violet limit of observation. These Te u multiplets 
are presented in table 5, which is constructed like 
tables 1, 2, 3. The Tei terms revealed by these 
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Taste 5. Multiplets in the Te ui spectrum 
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31287. 8 


2208. 10(10) 
43500. 8 
2285. 47(3) 
43741.2 


z 5P3 
43500. 8 
2 5Ps 
43741.3 
: 5Pi 
43904. 9 





TABLE 6. 


Spectral terms of Mn and Te u 


Mou 


Term 
s\ mibol 


Electron 
configuration 


0. 
9472. § 
88806. 5 

38543. 

38366. 


370. 


484. ! 
We 


43: 
13 
43: 


multiplets are displayed in table 6 alongside the 
analogous terms in the Mn spectrum [8]. Atten- 
tion is called to the large separation (12,617 em~') 
between a*S,; and a°S,, and to the fact that the 
intervals in the 2°P° term are almost exactly in the 
ratio 3 to 2 as required by the Landé interval rule. 
In the Mn i spectrum the second metastable state 
is 3d°a°D, which combines with 3d°4p 2°P° to 
produce a prominent multiplet in the ultraviolet 
(3441.98 to 3497.53 A). 
Te u lines was sought with the aid of the inter- 
vals, 240.5 and 163.6 em~'. In view of the stronger 
binding of d-type electrons in the Te 1 spectrum, this 
‘Te 1 multiplet must be expected to lie farther in the 
ultraviolet. Failure to find it among the observed 
lines forced the conclusion that in Te mu the term 
a °D has lower energy than a *S, and that the multiplet 
under discussion will therefore have wavelengths 
outside the present limit of observation of Te spectra. 
Until this multiplet is observed it is impossible to give 
the correct values of Tc u terms in table 6 relative to 
the normal state a°D=0.0; this uncertainty is 
expressed by z. Further progress in the analysis 
and interpretation of the second spectrum of tech- 
netium would be greatly facilitated by extending the 


~ spe 


The analogous multiplet of | 


Mn iu 


Interval Level Interval 


0. 0- 


12617. 1-4 324444 
502 +4 
+ 240+4 
599 +r 


263. 5 39308. 


176.9 


| observations to both shorter and longer waves than 


the present limits. 
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Properties of Beryllium-Barium Titanate Dielectrics 
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Dielectrics having compositions in the system BeO-BaTiO;-TiO, were matured at 


1,240 
shrinkage. 


( 
25° C the dielectric constant 


to 1,525 


\t 


Data are given for the compositions, heat-treatments, absorption, and 


(kK 


and the reciprocal of the power factor (( 


were determined at frequencies of 50, 130, 1,000, and 20,000 kilocveles per second and at 


3,000 megacyeles per second when A is not greater than 50 
> and 
A and © were determined at temperatures between 25° and 200° (¢ 
pansion was measured over the temperature range 25 


was determined at temperatures bet ween 


0.77 percent 


decreased by a factor of 10° over a few days. 
changed with time 


I. Introduction 


vious studies, by several investigators, of 
ate dielectrics have shown that these ceramics 
iseful in the field of electronic instrumentation 
» Sj)! Special applications, such as in some 
ature electronic devices, necessitate dielectrics in 
pacitors that will function at elevated temperatures 
hout excessive electrical losses. One of the best 
nsulators at high temperatures is beryvllia, and some 
ceramies containing bervllia are characterized by low 
lectrieal losses [6]. Despite these facts and the in- 
eased interest In titanate ceramics during the past 
decade, very little information on bervilia-titania 
dicleetries has been published. 

The present paper, dealing with dielectrics com- 
posed of bervllia, baria, and titania, is the fourth in 
v series on ceramic dielectrics made from titania and 
the oxides of the alkaline earth elements. An ex- 
tended range in composition for the system BeOQ- 
BaQ-TiO, was covered in this investigation in order 
io reveal dielectrics that may have desirable proper- 
lies at various temperatures from —60° to +-200° C. 


iton 


71 


45, 


II. Preparation of Dielectrics and Methods 
of Test 


Dielectrics having the compositions shown in 
figure 1 were prepared from reagent quality barium 
carbonate and beryllia with the commercial grade of 
titania (grade TMQ) used in the preparation of 
alkaline-earth titanate dielectrics previously investi- 
gated [2, 4, 5]. 

The methods of preparing these dielectrics and of 
determining their properties have been previously 
deseribed [2, 4]. For measurements of A and @ at 
25° to 200° C, the silvered test disk rested on a thin 
flat silver electrode on a hot-plate, and a silver-wire 
electrode, in a vertical position, touched the top 
center of the test disk. These silver electrodes were 
connected to the terminals on the (-meter by short 


vures in brackets indicate the literature references at the end of this paper 


60 


The electrical resistivities of a few specimens were measured at 200 
a direct-current potential constantly applied for several days 


k 


15 


At | megacyele per second A 
C and at 130 kiloeyvecles per second, 
The linear thermal ex- 
It varied from 0.58 to 
(’, using 
In some cases the resistivits 
‘or specimens of some compositions, A and 


S85 


to 700° ©, 


lengths (12 in.) of heavy copper wire. ‘Temperatures 
of the disks on the hot-plate were measured with a 
calibrated copper-constantan thermocouple (B&S 
Gage No. 34) inserted into a small hole extending 
from the edge to the center of a similar disk placed 
next to the disk being measured. Due to the small 
thickness (0.1 in.) of the test specimen, the entire 
piece was maintained within a small temperature 
gradient. made on specimens of barium 
titanate by this procedure gave a Curie point between 
115° and 120° C, indicating an accuracy of about 

Bec, 

For electrical leakage tests on capacitors, matured 
specimens 0.75 in. square by 0.010 in. thick were 
coated over an area of 2 em? on each face with fired- 
on silver electrodes, and silver lead wires were 
attached. The specimens were heated in an elec- 
tric furnace, and the temperature was measured on 
a mercury-in-glass thermometer placed next to the 
test piece. Leakage current was measured on a 
microammeter, 


Tests 


III. Results and Discussion 


In table 1, data are given for the composition, 
heat-treatment, absorption, shrinkage, dielectric con- 
stant (A), and Q-value (reciprocal of the power 
factor) of mature specimens. The data for a given 
composition are considered to be the most representa- 
tive of those obtained from measurements of four to 
seven specimens. 

The effect of composition on the maturing tem- 
perature of the specimens is shown in figure 2, in 
which the specimens having the same maturing tem- 
perature are connected by lines. In all cases, the 
range in temperature within which mature specimens 
could be produced was not determined, but this 
range usually was not more than 25° C. Of the 33 
bodies with ternary compositions, 25 were matured 
at 1,275° C, or less, within 1 or 2 hours. Specimens 
designated BBe63 (table 1) required several heat 
treatments of 4- to 6-hours duration at 1,315° C to 
reach maturity. One or more relatively low quin- 





tuple points in the phase diagram for this system 
are indicated by the low-maturing temperatures 
over a wide range in composition. 

At 1 Me/’s and at 25° C, the variations in values 
of K and & with composition are shown in figures 
3 and 4, respectively. These diagrams were con- 
structed from the data on K and ( given in table 1. 
In the binary system beryllia-titania, with the sub- 
stitution of BeO for TiO, the values of K decreased 
from near 100 for TiO, to 17 for bodies of composi- 
tion 6BeO:TiO,. The observed value of K_ for 
BeO: TiO, was 57, compared to 71 reported by B. M. 
Wul {7). The higher values sometimes found are 
probably due to the presence of small amounts of 
the lower oxide of titanium, which has a dielectric 
constant of several thousand. (@-values remained 
at several thousand until about 50 weight percent 
of BeO was present. Higher percentages of BeO 
resulted in specimens with lower (@, which decreased 
to 130 for 6BeO:TiO,. The dielectric constant and 
(J-values for BeO (99.7°% pure), matured at 1,925° C 
and measured at 25° C and 100 ke/s, have been 
reported as A=6.3 and Q=420 [6]. 

Within the ternary system, for dielectrics contain- 
ing percentages of TiO, greater than about 60, the 
replacement of BaO by BeO did not change the 
values of AK very much, but usually increased the 
Q-values. However, on the join 8BeO:TiO,- 
BaO2Ti0,, K decreased from 200 for BaO:2TiO, to 
30 for 3BeO:TiO,, with a minimum of 25 about 
midway along the join. The @-values usually in- 
creased as the BeO content increased. On _ the 
4BeO:TiO0,-2BaO0 3TiO, join, K decreased rapidly 
from 900 for 2BaO:3TiO, to 23 for 4BeO:TiO,. The 
addition of 9 weight percent of 4BeO:TiO, to 
2BaO0:3TiO, decreased Q from 50 for 2BaO:3TiO, to 
33 for 2BBe49 (table 1), but greater percentages of 
4BeO:TiO, increased the Q to a maximum of 3,000 
at about 90 percent. With more than 90 percent, a 
decrease in Q to 300 for 4BeO:TiO, was found. On 
the 6BeO:TiO,-BaTiO, join, K decreased from about 
1,500 for BaTiO; to 17 for 6BeO:TIO,. Q-values 
decreased from about 100 for BaTiO, to 30 when the 
content of 6BeO:TiO, was 25 percent (BBe67, table 1). 
Further additions of 6BeO:TiO,, increased the Q to 
a maximum of 8,000 at 90 percent. However, at 
100 percent of 6BeO:TiO,, the Q had decreased to 130. 

The effect of variation in frequency on the Q-values 
may also be noted in table 1. In the large majority 
of specimens, the Q-values were lower at 50 ke/s 
than at the higher frequencies. At 20 Me/s, they 
were usually higher than at 1 Me/s for specimens of 
high titania content, but lower for those of high 
beryllia or barium titanate content. At 3,000 Me/s, 
higher values of Q were found than at 20 Me/s for 
about half of the specimens tested. 

In some miniaturized electronic equipment, the 
parts are subjected to temperatures of about 200° C, 
necessitating a knowledge of the properties of the 
dielectrics under these conditions. Data are given, in 
table 2, for K and Q measured at 130 ke/s at tempera- 
tures between 25° and 200° C, and for the average 
temperature coefficient of K. These data, in most 





16 


instances, were obtained on one specimen only of ex + 
composition. For most of these dielectrics, K 
decreased with increasing temperature, and | \e 
average temperature coefficient of AU ranged fr m 
35 to —810 ppm’? C. Q-values usually wi re 
decreased greatly at higher temperatures, and 4; 
200° C only seven specimens had Q greater than 5.0, 
with the highest (1,400) for 2BBe8. Only 10 of ty 
54 test pieces had a higher ( at 200° C than at 25° © 
and most of the 10 were of high barium titanate 
content. The “Q” meter was warm and uncer 
voltage control of +15 percent when the measure- 
ments were made on specimens having a capacitance 
of 100 to 200 wuf. Values of the average temperature 
coefficient of A are probably within +20 ppm/’° C 
The temperature of the test pieces was raised at the 
rate of about 3 deg C a minute, and readings were 
taken at 25 deg C intervals. In order to illustrate 
the variations in Q resulting from changes in tem- 
perature and composition, figures 5 and 6 were 
constructed for temperatures of 100° and 200° C, 
respectively, and are based on the data in table 2. 
In table 3 data are given for the values of K, at 
1 Me/s, over the temperature range of —60° to 
+85° C and for the average temperature coefficient 
of K. These values were obtained by measuring the 
capacitance of test disks at 10-deg intervals, with 
the temperature held constant at each interval for 
at least 15 min before measurements were made 
The average values of the temperature coefficient of 
kK, last pt of table 3, are considered to be not 
better than +10 ppm/° C, and most of them are 
negative within the range —40 to —840 ppm/° C. 


In order to illustrate the variation of A’ resulting 


from changes in temperature and composition, fig- 
ures 7, 8, and 9 were constructed for temperatures 
of —60°, + 100°, and + 200° C, respectively. These 
figures contain isodielectric-constant lines, based on 
the data in tables 2 and 3. 

After storage for 6 months under room conditions, 
two or more specimens of each composition were 
retested for K and Q at 25° C and 1 Me/s. The 
results for the dielectrics that showed a significant 
change are given in table 4. These changes, par- 
ticularly in A, are not as large, on the average, as 
those found to occur in the titanate systems pre- 
viously studied [2, 4, 5]. Specimens of only one 
composition (BBe69), of high BaTiO, content, de- 
creased in K and at the same time increased in Q by 
an appreciable amount. The absence of much solid 
solution between BeO and BaTiO, may account for 
the smaller change, with time, in the electrical 
properties of these dielectrics compared to those 
containing the other alkaline-earth oxides. 

Linear thermal expansions, determined by the 
interferometer method, were moderately high for 
representative specimens, as shown in table 5. 

In order to determine the usefulness of these 
dielectrics in capacitors at 200° C and under a con- 
stant potential, tests of the change in electrical 
resistivity over a period of time were made on some 
specimens of various compositions. On the assump- 
tion that practically all of the conductivity was 
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ch the piece, the approximate volume resistiv- 
is calculated from the amount of current and 
mensions of the test piece between the fired-on 
electrodes. The data obtained are shown in 

6 and figures 10 and 11. Barium titanate 
ens showed the fastest and largest decrease in 
vity, while the wafers of high bervllia content 

d the least change in resistance, After the 
only those specimens exhibiting a considerable 
ise In resistivity were found to be colored black 
rhout the volume between the silver electrodes, 
portions not under the clectrodes remained 
inged in color. This decrease in resistivity and 
ve in color is probably due to the reduction of 
etravalent titanium to the trivalent form. <A 
sal in the polarity of the applied voltage resulted 
temporary increase in resistivity, indicating a 
al reoxidation. The most variable resistivity 
i were obtained for specimens of barium titanate. 
of the factors most likely to influence the 
stivity is the number of dark-colored spots in the 
men before the test. The data on barium 
nate given in table 6 and figure 10 are the best 
ues for specimens made with a commercial grade 
titania. At 200° C and under 3-v direct current 


mil, the resistivity for BaTiO, is 1.0% 10° ohm- | 


when measured immediately after applying the 
For the other specimens given in table 6, 


oltage. 


the resistivity just after application of the voltage | 


is greater than 10" ohm-cm. In order to find how 
much voltage was needed to cause a decrease in 
resistivity with time at 200° C, the voltage on 
specimen BBe67 was increased to 100 (10 v/mil) after 
the specimen had been at 30 v, and 200° C, for 
2) days. The resistivity remained greater than 
10" ohm-em for 4 additional days and then de- 
creased to 210" ohm-em. 


The decrease in resistivity shown by these di- 


electrics, When subjected to a constant voltage, 
greatly lowers their usefulness for high-temperature 
duty. Those bodies, however, which showed a small 


loss in resistivity at 200° C could be used for the | 
production of capacitors for operation at this tem- | 


perature if the working voltage is not too high. 


IV. Summary 


Dielectrics having compositions in the system 


BeO-Ba TiO,-TiO, can be prepared from mixtures of | 


titania with barium carbonate and beryllia. Mature 
dielectrics (less than 0.1% absorption) were made by 
heating specimens to various temperatures within 
the range 1,240° to 1,525° C. 


The dielectric constant (A) varied from 16 for 
specimens with compositions near that of 6BeO:- 
TiQ,, to several hundred for dielectrics of high 
baria content. Most of the temperature coefficients 
of A were negative. The Q-values, at 25° C, ranged 
from 15 to 10,000 and were low for dielectrics con- 
taining large percentages of baria and for those with 
high beryllia content in the binary system BeO-TiOg. 
At higher temperatures, most of the Q-values de- 
creased greatly. A and Q-values of some of the 
dieleetries changed with time, 

Values for the linear thermal expansion over the 
temperature range 25° to 700°C, varied from 0.58 
to 0.77 percent and were obtained on dielectrics of 
widely different compositions. 

When subjected to an electrical potential at 200° ( 
for many hours, some of the dielectrics decreased 
greatly in electrical resistivity. 
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Ternary diagram for system BeO-BaTiO,-TiO, 
showing compositions studied. 


| B=BaO0; Be=BeO; T=TiOg; thus BefiT —4BeO: TIO, 
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Fieure 5. Q-values, at 100° C and 130 ke/s, with varying 
; composition within the ternary system BeO-BaTiO,-TiO,. 





BeO 8a0 


Figure 2. Approximate maturing temperature (| 
calcining treatment 


B= BaO; Be= BeO; T = TiOs:. 
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Ficure 3. Dielectric constant, at 25° C and 1 Mc/s, with | Figure 6. Q-ralues, at 200° C and 136 ke/s, with varying 


varying composition within the ternary system BeO-BaTiO,- composition within the ternary system BeO-BaTiO,-TiO, 
TiO,. 
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Ficure 7. Dielectric constant, at —60° C and 1 Me/s, with 
Ficure 4, Q-values, at 25° C and 1 Me/s, with varying com- varying composition within the ternary system BeO-BaTiO,- 
position within the ternary system BeO-BaTiO;-TiOd. TiO. 
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8. Dielectric constant, at 100° C and 130 ke's, with 
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APPROWIMATE VOLUME RESISTIVITY, ohm-crr 
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Figure 10. Approximate volume resistivity for specimens of 
BaTiO,;, TiO,, and 5BBe., at 200° C and 30 volts direct- 
current, after varying lengths of time. 


Thickness of all specimens Was approximately 0.010 in 
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Ficure 9% Dielectric constant, at 200° C and 130 ke/s, with | 
varying composition within the ternary system BeO-BaTiOs- 
TiO. 
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Ficure ll. Approrimate volume resistivity for specimens of 


5BBeo, 0.010 in. thick, at 200° C and 110 or 30 volts direct- 
current, after varying lengths of time. 


A, 110 v; B, 30 v; 1, polarity reversed; 2, original polarity restored 





TARLE 1. Composition, heat-treatment, absorption, shrinkage, dielectric constant K, and Q of bodies in the system 


6 BeO:TiOg- Ba TiOs-TiO: 


Heat treatment 
Dielectric constant, Reciprocal, Q, of power fact 
: 5 ; 25°C 


omposition weight K. at 25° 
Speci No.2 
men 
desig 
nation 


Proportion of end mem 
bers of join, weight 
percent 


Absorption 


BeO: 8TIO, Bad: IsTiO 


100 “0 
1.75 fs, 25 
oo moo 


Be 2TiIO, Bad: 12TIO 


ooo oo 
52.9 47.1 
22.2 77.8 


oo lane 
2BeO 8TiO, Bad YTiO 


woo 00 
a0 7.0 
wa a2 6 
00 moO 


BeO: Tidy Bal): 6TIO, 


Bel 100.0 0.0 
#BBe2 73.2 2s 
6BBeS 0 0 
6B Bes 12.6 av.4 
BTSs 0.0 100.0 


SReO 4TiIOn Bah: STIO‘ 


100.0 0.0 
76.6 z.4 
0 w.0 
0.7 so. 3 

00 100.0 


tBeO- 2TIO, Bad. 4TIO 


100.0 0. 

sO.0 » 
42.: 
wo 
100 


2BeO: TiO, BaO: 3TIOy 


100 
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TiO, Bao: 2? 





Com position, 


heat-treatment, absorption, shrinkage, dielectric constant K, and Q of bodies in the system 
6BeO: Til »-Ba Til 9- Til »— Continued, 


Heat treatment 
Composition weigh Dielectric constant, Reciprocal, Q, of power factor 
K, at 25° ¢ at 25° ¢ 
Proportion of end mem 
bers of join, weight 
percent 


ture 


Pemper 
Absorption 
Shrinkage 


iBeO: TiO, 2Ba0 


100 
~ 
70 
et) 


4 
0 loo 


TiO, Bad 





Tapie 2 Dielectric constant (K ()-values, and average temperature coe ficient of K from 25° to 200°C, at 130 ke’s 


A verag 
temperatt 
coefficier 


Specimen designa 


thon 
oO 


ppm 
PiOs on 


BeTS 
ISB Bet 
BT Is 


Ke T2 
12KBe4 
12B Be? 
BTI2 


Be2Ts 
OB Res 
OB Bes 
BTY 


Bel 
6B Be2 
ABRs 
HB Be 
BT6 


Best 

5BBe2 
SBReS 
SKBe 
Brs 

Bes T? 
iBBe2 
iWBes 
iBBey 
BT4 


Be2t 
{BRel 
tBBes 
t{BBet 
tHBes 
BT3 


Best 

2B Bel 
2B Res 
2BBesS 
2H Bes 
2BKee 
BT2 


Beat 

2BBe4! 

2B Bets 

2B Bets 

2B Bet? 

2B Bets 

B2Ts 7 


Bes T 16 

BRestl 16 

BBet3 17 

BRets . 4 

B Betz ine { 

B Bets 1, 190 1,32 
BT 1, 210 1,800 


» Maximum K value occurs between 100° and 125 





3 Dielectric constant, at 1 Me/s, from 60 5°C, and average temperature coeficient of dielectric constant 


ilues of A at Average 
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cient of A 


1, 160 





Dielectric constant, A’ (Quality factor, 4 


Specimen 


designation After 1 After 6 


A verege After 1 After 6 “hee Average 
day months Change 


Change 
change day months change 


ecimens with significant change in K 


Percen Percent Percent Percent 
2B Bey-2 5 +4 
. 2 5 A ‘ 0 


4 f 0 
2BRet? 7 ") 
! H4 
Kh 
2BBesw 
4 
4 
B Bet? 2 i ‘ : 
2 f 645 ; 2 ; PaBie 4. 
5 oo. t 
B Beso-2 1.080 Changes in K and Q of 
4 d 1.006 i specimens after 6 months storag 


4 rg 1. 0f0 ; 
s 1, 074 : : Measured at 1 Me's and 25° 


Specimens with significant change in Q 


iKBeS a0 
as fo 


1B Bet-1 a 000 
2 28. ‘ 0 


SBBeg 7 . oo 
A 200 
-B 3 7.2 140 


2B Bes 27 330 
-B : 1) 


2B Be5-1 25 f 000 
~A wn 
- » ae 


B Be43-1 ‘ Hoo 
A 1s) 


BBet5 270 
-A 


B Bets 3.2 
- 62.4 
-B 2.3 


* Apparent change not significant due to small capacitance of specimen 


lemperature range from 25 
Specimen designation 
1 200) 100 100) TOO 


Percent Percent Percent Percent Percent Percent Percent I ABLE 0. 


HBeEO- TiO» om 0.38 O48 

1BeO: TiO 5 F 

2ReO TiO. Pe = ro - Linear thermal expansion 
BeO- TiOs om tr) His 

SBBed OS 3 53 

BRetis O5 


a ee es 


43 M 


TABLE 6. Approximate volume resistivity of some specimens, 
Resistivity in ohm-em after 0.010 in. thick, measured at 200° C and ata potential of 30-1 
Specimen desig ; . » . » ; F 
nation direct current maintained for various periods of time 
| 1 he 1 day 2 days 5 days 21 days 
* In 2 hr after 2 days, the resistivity began to decrease rapidly and, withir 
BaTiO | 6x10 8x 108 8x 108 an additional 2 hr had decreased to 110° ohm-cm, when test was dis 
t , . 
TiOs 1 10° 2x >A xX 108 SS days 
.~ ; ‘ o-4 . Bee: © After 15 days 
a 
10 > 10 > 10 
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The refractivity of lithium fluoride was measured from 0.4 to 5.9 microns and expressed 
for 23.6° C by a four-constant dispersion formula by which an interpolation table was 
prepared for included wavelengths. The temperature coefficient was determined as 

1.63 x 10-5 (constant within + 0.05) for the visible region and a table of indices made for 
20° to 60° C at equal intervals of A from 0.4 to 0.7 micron. Most of the observed indices 
agree with the tabulated values within +3 10~* for the visible region and + 3 10-5 in the 
infrared. In calibrating an infrared spectrometer simple empirical relationships between 
drum reading, ), and wavelength are not entirely reliable. Three steps are recommended: 
(1) the expression of D in terms of FE, the emergent angle of energy from the prism; (2) 
expression of EF in terms of refractive index; and (3) the obtaining of the indices or their 
expression as a function of A. The first step requires at least two constants, one for angular 
equivalent of the screw and another for the particular Littrow-mirror orientation for D—0. 
The second requires knowledge of the refracting angle of the prism and its orientation with 


respect to the incident energy. 
length usually requires four constants. 


Lithium fluoride is preferable to calcium fluoride 
for use in prismatic form in the infrared to about 6u 
because of its much greater dispersion. Synthetic 
lithtum fluoride has been available for 20 years or 
more, and data on its indices of refraction have been 
published by Littman [1]? for the visible and to 3u; 
also by Hohls [2] to 3 and 4 decimals from 0.546 to 
\2u. The 4-deeimal indices by Gyulai [3] are chiefly 
for the ultraviolet region. 

Formerly, the making of lithium fluoride was car- 
ried out completely in an atmosphere of air, whereas 
more recently a vacuum technique has been used, 
with a concomitant change in some of the optical, 
mechanical, and thermal properties of the product [4]. 
Thus some question has arisen regarding possible dif- 
ferences in the refractivity of various specimens of 
this material. No final answer should be made at 
present, but it can be said that only small variations 
in the fifth decimal of refractive index have been 
found at this Bureau among six or more prisms of 
synthetic lithium fluoride that have been carefully 
measured in the visible region, including one that 
has been on hand for about 12 years and used as a 
secondary standard. 

On this NBS standard prism refractive-index 
measurements were made in the visible region for 
five wavelengths in four groups of experiments at 
temperatures that averaged 28.6°, 31.6°, 52.6°, and 
8.9" C. On two other prisms, at seven wavelengths, 
dispersion measurements were made at temperatures 
near 25° © and represented by a four-constant 
Ketteler-Helmholtz dispersion formula with the 
parameters adjusted by least squares. These 
prisms were lower in index than the above mentioned 


e Work described in this paper Was carried out in part under the sponsor- 
the U. 8S. Air Force at the National Bureau of Standards A report 
Was presented on October 27, 199 at the Cleveland meetine of the 
in Optical Society 
ires in brackets indicate the literature references at the end of this paper 


The adequate expression of index as a function of wave- 


standard by only 1.4 10°°, and no differences in dis- 
persion could be detected. From these data, by the 
process of graduation, table 1 of the average refrac- 
tivity of lithium fluoride was prepared at 5-deg 
intervals from 20° to 60° C at intervals of 100 A in 
wavelength from 4000 to 7000 A, 

These data in the visible region vield tentative 
values for the temperature coefficient of refractive 
index. The average found for data on five wave- 
lengths and for a very small temperature interval 
near 30° C is —16.4%10°° per 1° C, and similarly 
for a small temperature interval near 56° the average 
is —17.510°°. Some of the same data yield an 
average value of 16.3%10°° for the interval 
31.6° to 52.6° C. For red light the values are some- 
what larger than for violet, perhaps 5 percent larger. 
These values agree well with the coefficient of 

16.67 10°° published by Littman, and are plotted 
in figure 1 for comparison with similar values re- 
ported by Radhakrishnan {5}. 

The coefficients in the right-hand columns of table 
1 were obtained from differences smoothed in such 
manner that the results approximate average values 
for the derivatives. The change in index with wave- 
length applies almost equally well at all listed tem- 
peratures. The change with temperature is more 
nearly valid at the mid temperature of 40° ©. 

The refractive-index measurements herein reported 
for the infrared region were made on a prism with 
faces about 6 by 8 em and a refracting angle, «1, of 
72°2'24”’ as ground and polished by the Perkin- 
Elmer Corp. of Glenbrook, Conn. The first: work 
was done on a minimum-deviation spectrometer 
for several wavelengths in the visible region in order 
to compare its refractivity with that of lithium 
fluoride prisms that had been previously measured. 
These index values proved to be intermediate with 
respect to, and closely an average of, those previously 
determined for other specimens. ‘They are expressed 
with approximately sixth-decimal-place precision in 
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Tape 1. Refractivity of lithuim fluoride at various temperatures in the visible region 


O0.00471453 


[ The column for 25° C was computed as n?=1.9261590—0.00705034X2+ —0.0004731 J 
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the visible for a temperature of 23.6° C by the 
formula 


0.0047 1433 
P .926223 0.00705034\? , 
w= 1 .Saesasl (05034N + y2 9 0094731 


where, as usual in such equations, \ is to be expressed 
in microns. 
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This formula was used in computing the index 
1.39102 of this prism for the cadmium red line, 
\=6438 A, which is the basic reference point in the 
index measurements for the infrared region. Also, to 
insure that the slope of the dispersion curve would be 
well controlled at the short-wavelength end, the 
indices computed by this formula were used as 
“observed” values for seven other wavelengths in the 
visible region. 

The method used for index measurements is the 
same as that already described for measurements on Ficure 1. Temperature coefficient of refractivity of lithium 
crystals of thallium bromide-iodide [6] and on a fluoride (averages for visible region). 
crystal of silver chloride {7}. That is, briefly, the For red light the values are somewhat larger than for violet, perhaps 5 percen! 
prism was installed with the aid of an auxiliary prism | “8° *: SBS: 0, bittman (mo) eens 
at an angle of known incidence on an infrared spec- 
trometer equipped with a Littrow mirror; a thermo- 
couple was used and a measure of the received energy 
was recorded on a Speedomax potentiometer. The 
incidence angle, i, for which the prism was installed 
was 54°51'28"’, which corresponds fairly well to 
(A+ D)/2 at minimum deviation for the cadmium red 
line. By ray tracing, one computes an angle of emer- 
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the upper curve was obtained with automatic slit control for constant radiant energy 


rane 


we, By, of 5495455” for the initial conditions at 
438 A. 
The angular increments for the Littrow mirror, 
between the initially and the subsequently recorded 
energy Maxima and minima, when applied to the 


use in 


initial A, give approximate values of /\ for 
computing indices by the equations 


snk 


sin 1 sin 7 


cot 7’=cot A+ 


n=sin 7(1+-cot?7’) 

If the entrance and exit slits of the infrared spec- 
trometer subtend an angle at the collimating mirror, 
it is Necessary to make appropriate correction for 
the fact that one-half of the corresponding angle at 
the face of the Littrow mirror is not constant as 
wavelength varies. The index-of-refraction meas- 
urements were made in the infrared region by the 
of a Perkin-Elmer infrared spectrometer. The 
prism remains stationary, and different regions of 
the spectrum are obtained by rotation of a Littrow 
mirror. A new-type thermocouple, developed by 
the Perkin-Elmer Corp., was used as the detector. 
lt was used with both continuous and modulated 
radiant energy. Final drum readings, however, 
were obtained by slow manual control of the drum 
near the maxima of deflections, in order to avoid 
errors caused by small time lags in the response of 
the pen to changes in energy at the detector. 

The wavelengths used in the index measurements 
in the visible and near infrared region were obtained 
by the use of the radiant energy from an FH-4 
mereury-cadmium lamp. Several lines from this 
source in the visible and near infrared region are 

ficiently intense that they may serve as precise 
st endian for determining the relationship of the 
Littrow mirror angle and wavelength. In table 2 


use 
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Infrared absorption spectrum of water vapor in the 5- to 6-~ region as measured with a lithium fluoride prism 


The lower curve was recorded with a constant slit 


are listed 15 lines of the speetrum of Hg and Cd that 
have been used for the measurement of the indices of 
refraction of LiF. 

In the region from 2.7 to 6 w nine absorption bands 
of methanol, polystyrene, carbon dioxide, and water 
vapor were used for the standard wavelengths. The 
wavelengths of the two absorption bands of methanol 
were taken from the work of Borden and Barker |S}. 
The values of indices of refraction experimentally 
determined by the use of 2.7144- and 4.866- absorp- 
tions of methanol differ by +7 107° and —9 10 
from the adjusted values obtained by the use of the 
empirical equation (2). There may be a small error 
in the grating determinations of these two wave- 
lengths. 

The atmospheric water vapor absorption spectrum 
was recorded from 4.9 to 6.2 up. The wavelengths of 
the lines were selected from the measurements of 
Plyler and Sleator {9} and H. H. Nielsen [10]. While 
there is some doubt as to the correctness of the wave- 
lengths to the fourth decimal place, the accuracy is 
sufficient for the determination of these indices of 
refraction of LiF in the 5- to 6-u region. In figure 2 
there is shown a recording of the water vapor spee- 
trum when medium amplification is used. 

The upper curve was obtained when the automatic 
slit control was in operation, thus making it possible 
to hold the radiant energy almost constant. Because 
of the absorption of the prism material and_ its 
increase in dispersion with wavelength, it was neces- 
sary to increase the slit width by a factor of 3 in 
going from 5.50 to 6.24. The lower curve represents 
the recording of the water vapor spectrum with 
constant slit. The wavelengths of some of the lines 
are marked on the graph. ‘The numerical values are 
expressed in microns and are given to the third 
decimal place. 

The refractive indices as computed from the 
observed data by eq (1) were adjusted by least squares 
through determination of the parameters of a four- 
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TABLE 2. Observed and computed data on index of refraction 


of LiF (23.6° C) 


Drum reading in 

turns < 100 
Wave- 
length 


Source or absorbing 
medium 


Computed 
inde. 
Com- 


Observe 
bserved puted 


+2k 1 
0 
0 


2245.6 
2224. 0 
2191.1 
285.4 
aM2.0 


yellow (mean 


He 


He 
He 
He 
He 
He 


CHLOH 
Polystyrene 
do 
do (center 


cr 
CH)OH 


1uNo. 9 
1917.9 
1s00.0 
In. 2 
1764. 6 


lite. 3 
1473. ¢ 
1365.9 
ws. 2 


a0 fsl.4 


HeO vapor 5 
. do 
do 
do 


$2277 515 Sl 
s1392 $i: 
sO408, 104 
SOON) +3 2.5 7 


5 
5 

; 
1 


58044 


* These larger residuals, +7 and —9X 10-5, both for the wavelengths defined 
by absorption of methanol, would vanish if the wavelengths were in error by 
+0.0056 and —0.0086 w, respectively 


constant Kettler-Helmholtz dispersion formula as 
limited to one infrared and one ultraviolet Sellmeier 
term. The result is 


0.00492029 


4091.74975 
d\?— 0.0053 160" 


n?=7.0537595 —~ 
. , 797. 8925296 


> (<) 
ts 
which is preferable in computations to its equivalent 
form 

0.92556295 


\?— 0.0053 160- 


51281966 
~ 797.8925296 — 

The agreement between the indices computed by 
this dispersion formula and those observed — is 
excellent, as shown by the residuals tabulated in 
table 2 except for the two instances in which methanol 
was used. We think that the wavelengths for these 
two observations are slightly inaccurate and suggest 
that 2.7088 and 4.870 uw are better values than those 
listed in table 2 and used in computing the index data. 

Figure 3 shows the results of this work on an 
infrared spectrometer compared with previously 
reported values of the index of lithium fluoride, the 
present observed values being represented by circles 
and the computed values by the line An=0. Table 
3 expresses these computed values in a form for 
convenient interpolation at any value of \ from 0.5 
to 6.0y. 

The precise fitting of these data by formula (2) is, 
we think, the first accurate representation of index 
of a solid by four constants over a comparable 
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Figure 3. Comparison of observed (circles) and computed 
(An=0) indices of refraction of lithium fluoride with pre- 
viously reported values. 

The indices computed at N BS by formula (2) of this paper have been subtracted 
from all others. No allowances for effect of temperature have been made, but 
differences caused by temperature are less than 10x 10-5 Recent values of index 
by Durie (J. Opt. Soe. Am. 4, 879 (1950!) and values that he quotes by Kessler 
are in good agreement with NBS values. ne, Gyulai; a”, Hohls; az, Littma 


no, observed at N BS; ne, computed at N BS; ax, Radhakrishnan. 


spectral range (the absolute term of formula (2) 
being 1 plus functions of the four constants). The 
five-constant form 


(4) 


where & is independent of the other parameters, 
and also the five-constant form 


nr a*—K pm +e 


(which latter we successfully used for KRS-5 and 
for Ag Cl) were each tried (in simultaneous solutions 
for lithium fluoride, but neither seemed more 
suitable than the four-constant formula with only 
the two Sellmeier terms. However, this comparison 
of formulas is not definite because a least squares 
adjustment was made only on the one last named. 

For KRS-5 it seems that the inferiority of the 
four-constant formula may be said to occur because 
one fictitious infrared band cannot represent the 
band near 117 and also the other near 152y as well 
as can be done by three terms in the expansion of 
each of their respective Sellmeier terms. On_ the 
other hand, for lithium fluoride, the band near 16. 
is so comparatively unimportant that its effect and 
also that of the band near 23u can be effectively 
represented as one. That the effective location 
(28 yw) of this one infrared band is not in this case 
intermediate between the actual locations mentione: 
is of no particular import. These dispersion equa- 
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have an approximately theoretical basis, but 
on is necessary in quantitative reasoning based 
cir use. Whenever parameters are determined 
urely statistical methods their values must to 
extent perform functions other than those 
butable to their particular simple interpretations 
the formula in every way theoretically 
juate. 
Che advantages of using an equation rather than a 
phical method in the calibration of an infrared 
clrometer are pointed out by MeKinney and 
edel [11], but such procedure cannot be relied 
for accuracy [12] unless the equation used 
theoretically applicable or thoroughly tested 
throughout the range of use. 
in special cases, particularly for short intervals of 
wavelength, sufficiently remote from the resonance 
sions, one may find comparatively simple re- 
ationships between drum readings, D, and the 
wavelength, A, but in general with any spectrometer 
and prism the safe procedure is (1) to express D in 
terms of the emergent angle, F, and (2) to find the 
proper relation between /\ and the corresponding 
refractive index, nm. 
For the spectrometer used in this investigation, the 
drum readings are nearly proportional to changes in 
emergent angle, and one may write 


ss Is 


Is 


is 


I 
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D=(E—e)/s, (6) 
where e is the particular value of & corresponding te 
D0, and s is an average angular equivalent of 
the serew. 

The values of F for use in computing D for any 
prism of known refracting angle A are obtainable 
from eq 1, which may be written 

sin K=sin A (n?—sin® i)'?—cos A sin /, (7) 
\ =e i, the angle of incidence, will be known by the 
nner of installation (sin i=n sin A/2 if set for 
iinimum deviation) and values of the index, n, 
must be obtained from a table or computed from an 

ppropriate equation expressing n as a function of X. 

For this spectrometer and our installation of this 
prism, after writing equations 
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for all our observations and adjusting by the method 
of averages, we find 0.444259° and 45.1802° 
if the unit of is 0.01 turn. For our prism, 

sin £=0.9512720 (n?—0.6686737)'*0.2521473, (9) 
and we have by these means obtained the computed 
values of Das listed in table 2. 

In figure 4, where the observed and computed 
values of )) are compared, it will be noticed that the 
observed values of J) are systematically low by about 
0.01 turn of the drum, between 1) readings of 440 
and 1980 (turns 100) and of opposite sign else- 
where. These slight differences are a result of two 
causes. First, the angular equivalent of the serew 
is only an average value. A calibration of the screw 
was used in the precise reduction of the observed 
data but is ignored in this subsequent proposal for 
computation of drum readings. Second, the small 
angular subtense of the entrant and emergent slits 
is also ignored in this computation of « lrum re vadings 
from the adjusted observed data. 

Consequently, if the slits are in near angular 
coincidence, and if the serew advance (reasonably 
free from periodic error) is nearly proportional to 
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The systematic difference occurs chiefly because the relation between drum 
reading and angular orientation of the mirror is not strictly linear An error of 
one hundredth turn is approximately equivalent to an error of 4 10-5 in refractive 
index. (The two circles represent data based on the absorption bands of meth 
anol for which there may be small errors in the reported values of Wavelength 
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angle of orientation of the mirror, then a satisfactory 
calibration of the drum readings in terms of wave- 
length can be made, provided the drum readings are 
known for some 10 or 15 wavelengths for which the 
indices are reliably known or obtainable. In fact, 
if the drum readings can be observed with sufficient 
accuracy (higher than that needed in subsequent use 
of the spectrometer) then four or five wavelengths 
will in most cases be sufficient for the calibration 
(but with no remaining data to confirm the correct- 
ness of procedures). 

For precision and reliability in expressing re- 
fractive index as a funetion of wavelength the 
Sellmeier dispersion equation 


Ky? 


0 
\2—»? (10) 
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and its variants are unexcelled. It is best known in 
its slightly expanded form in which it is exactly 
equivalent to the Ketteler-Helmholtz equation 
having only four disposable parameters. For use 
in the most frequently encountered conditions, where 
the ultraviolet critical frequencies are proximate or 
more important than those in the infrared, one 
further expands the infrared term and writes 


m 
n?=a?—ky?— po ——~ a0 (11) 
: PNT )\2_p 
in which the term in A* can often be omitted. Simi- 
larly, for the opposite case, where the infrared bands 
are near and predominant, one uses 


M 


(12) 


" be 
eT ee 


but sometimes when both the ultraviolet and infrared 
effects are important it is advantageous, from the 
standpoint of high precision and accuracy, to keep 
the fractional remainders of both Sellmeier terms 


M - 

Powter wn 

which may be only a four-constant formula (a? being 

a function of L?, P, M, and m) or may be used with 
a as an independent parameter. 

The shorter the interval of wavelength, the more 
remote the critical frequencies, and the less precise 
the data, the greater are the chances for successfully 
using the fully expanded forms and those simpler 
forms written with the first power of n, such as 


n=a+bN+ 4+ ++ (14) 


and the numerous empirical variants that are » 
and then proposed, tested, and found satisfact 
under one or more of the three limitations | 
stated. 

In selecting the best variant of these dispers 
equations for use in any given instance, and in 
ciding how many constants should be used, it js 
usually advantageous to make simple simultaneo is 
solutions before resorting to least squares. In su) 
solutions it is advantageous to solve by systema je 
elimination and arrange the work so that the solution 
for e—1 constants can be made with very little addi- 
tional work after a solution for ¢ constants.’ Like- 
wise, in arranging the work for least squares soly- 
tions, one can so manage that most of the sums of 
the squares and cross products for the ¢—1 adjust- 
ment are obtained from among those computed for 
the ec-constant adjustment, and then the norma! 
equations can be so written and solved that little 
work will thereafter be necessary in obtaining the 
adjusted solution for e—1 constants. 

Much of the effort made to substitute simplified 
dispersion equations for those of the Sellmeier type 
stems from the nonlinearity of the constants when 
Sellmeier terms are present. An objection is that 
direct least squares solutions are not possible, and 
the differential process of betterment of some 
approximate solution must be used. Much of this 
objection vanishes whenever it is advisable to make 
preliminary simultaneous solutions in the process of 
selection of a proper equation. 
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For example, the observational equation that is not to be used in the second 
solution should be written either first or last, but not among, the other ¢—1 obser 
vational equations, and one eliminates last, or solves first, for the constant not 
to be used later 





h of the Nation 


{ Resear 


Preparation and Purification of Hydrogen Deuteride 


Abraham Fookson, Philip Pomerantz, and Fdwin H. Rich 


In an investigation of the physical properties of hydrogen isotopes it Was necessary to 


prepare 10 liters of high-purity hydrogen deuteride. 
aluminum hydride with deuterium oxide, giving hydrogen deuteride, in two batches 


98.1- and 97.0-percent purity, respectively. 


This was done by reacting lithium 
of 
his material was then fractionated at liquid 


hydrogen temperature in three batches, vielding the desired amount of hydrogen deuteride 


in a purity of 99.8 percent. 


I. Introduction 


onnection with a project on the measurement 
cise physical properties of hydrogen isotopes, 
; found necessary to prepare about 10 liters of 
purity hydrogen deuteride. This substance 
been prepared for the first time by Scott and 
owedde [2, 6] ' by passing a mixture of hydrogen 
deuterium over a heated metal filament, fol- 
wed by fractionation at liquid hydrogen tempera- 
The equilibrium mixture after conversion 
contained about 50 mole percent hydrogen deuteride, 
as would be anticipated from the equation 


] 
i) 


H,+ D.@2HD, 
for which the equilibrium constant is about 4 [5]. 

\lixtures containing a high percentage of hyvdro- 
ven deuteride have also been prepared by the action 
of deutertum oxide or water on various hydrides or 
deuterides, respeetively. These methods give, in 
general, higher vields of hydrogen deuteride than is 
btainable by equilibrating hydrogen and deuterium 
lhus, Norton [5] treated boron hydride, B,H,, with 
deuterium gas to obtain the deuterated product, 
3D, The latter, when treated with 83-percent 
sulfurie acid, reacted with the water present to 
vield a mixture of hydrogen isotopes containing 85- 
percent hydrogen deuteride. 

Similarly, lithium hydride with deuterium oxide 
has been used to prepare enriched hydrogen deu- 
teride [1]. With sodium hydride, 87-percent hydro- 
ven deuteride is obtainable [7], and with lithium 
aluminum hydride and deuterium oxide at 0° C, 99- 
percent hydrogen deuteride has been prepared {7}. 

For the purpose at hand, which requires hydrogen 
deuteride containing no more than several tenths 
percent of impurities, the last-mentioned reaction 
gested itself as the most appropriate. In addi- 
tion, to further purify the material obtained from 
this reaction, the fractionation technique of Scott 
and Brickwedde was employed. 


II. Apparatus 


lhe “crude” hydrogen deuteride was prepared in 
apparatus described by Dibeler [4], shown in 


ures in brackets indicate the literature references at the end of this paper 
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figure 1, except for the method emploved for adding 
deuterium oxide. This reagent was added by means 
of a hypodermic svringe through a neoprene septum 
veld in a fitting attached to the tapered joint, ©. 
It was found that the septum could be punctured 
several times without danger of admitting air to the 
system. This rendered it possible to add the deu- 
terium oxide in several portions, thereby diminish- 
ing the vigor of the reaction. 

In addition to the method for adding deuterium 
oxide, the apparatus of Dibeler was further modified 
by the inclusion of a magnetic stirrer in the flask, 1); 
this was operated by a rotating magnetic field out- 
side the flask. 

The crude hydrogen deuteride was distilled in the 
apparatus shown in figure 2. This is, with minor 
modifications, the apparatus used by Scott and Brick- 
wedde [2, 6] in their work; this technique has not 
heretofore been published’ The still, .1, consisted 
of a jacketed boiler of about 5-ml capacity and monel 
helix rectifving section, B, a re-entrant portion, ©, 
which served as a cold-finger dephlegmating con- 
denser, and a take-off tube, /, which was connected 
to the top of the still by means of a U-tube, J. The 
liquid was boiled by means of an electric heater, F, 
of constantan wire, the leads of which ran up the 
still and emerged through a vacuum-tight wax seal, 
G. The still was immersed in a Dewar flask of liquid 
hydrogen, the level of which was maintained at 
about the position indicated in the drawing. This 
Dewar was in turn immersed in another Dewar con- 
taining Jiquid nitrogen (not shown in the drawing). 

The still was connected through a ground glass 
joint to a regulator valve, //, thence to a graduated 
Toepler pump, J, of 500-ml capacity. The regulator 
valve consisted of a length of thin drill rod brazed 
to an iron core. The drill rod fit with little clearance 
into a length of heavy-walled capillary tubing, and 
was movable within this capillary by means of the 
solenoid, AK. By thereby varving the length of 
annular space between the drill rod and tube wall, 
the rate of flow of gas from the still to the Toepler 
pump was easily controlled. The pressure in the 
still was indicated by a manometer; by adjusting 
the levels in the Toepler pump by means of the mer- 
cury reservoir, \/, to a pressure less than that shown 
on the manometer, gas could be caused to flow from 
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deuteride. 


This drawing is identical with that reproduced in V. H 


Figure |. Apparatus 


Dibeler’s paper (4). 


the still to the Toepler pump at a rate determined 
by the pressure difference and the setting of the 
regulator valve. 

The Toepler pump was connected to a manifold, 
N, and from there to a mercury diffusion pump. 
The manifold was fitted with a number of ground 
glass joints, to which sample tubes, receiver bulbs, or 
bulbs containing charging stock could be attached. 
The general practice was to use the charging bulbs 
as receivers, after the material they contained had 
been added to the still and the residual gas removed 
by the vacuum pump. 


III. Preparation of Hydrogen Deuteride 


Since 10 liters of high-purity hydrogen deuteride 
were required, this could readily be obtained by 
distillation from 15 liters of crude prepared by the 
method of Wender, Friedel, and Orchin [7] (lithium 
aluminum hydride and deuterium oxide at 0° C). 
Generous allowance could then be made for losses 
incurred during handling and distillation. 

Accordingly, 15 liters of hydrogen deutride were 
prepared in two runs, of 5 and 10 liters, respectively, 
of 98.1- and 97.0 percent purity. Since both batches 
were prepared in the same manner, only one will be 
described. 

Into the Claisen flask, D (letters in this section 
refer to fig. 1), there was distilled about 150 ml of 
n-butyl ether, which had been previously dried and 
distilled from sodium. This was followed by 5.75 g 


of lithium aluminum hydride, the addition of this | 


reagent being carried out in a nitrogen atmosphere. 
This quantity of lithium aluminum hydride was 40 
percent in excess of that required to prepare 10 
liters of gas, as calculated from the equation 


LiAlH,+4D,0--LiOD + AL(OD), +4HD. 


The rubber septum-retaining device was fitted ‘9 
the flask, at (, and the latter attached to the e »- 
denser, £. 

The reaction mixture was frozen by means o! 4 
liquid nitrogen bath, the system evacuated, and tie 
reaction mixture heated until it boiled under its owy 
vapor pressure. After 1!, hours, the flask was ago in 
cooled with liquid nitrogen, and the system re-evacii- 
ated. By means of a hypodermic syringe, 5 1) 
of deuterium oxide (99.5-percent purity) was added 
to the solid reaction mixture. The latter was per- 
mitted to melt, and the gas evolved as a consequence 
of the ensuing reaction was collected in two 5-liter 
collecting bulbs attached to a manifold at J. During 
this operation, the mixture was being stirred continu- 
ously. Due to its low temperature, the outside of 
the flask quickly accumulated a laver of frost 
Whenever this began to melt, the liquid nitrogen bath 
was applied; in this manner, the reaction temperature 
was kept at 0° C or lower. 

During the course of the reaction, deuterium oxide 
was added two more times, to make a total of 18 ml 
of this reagent added. This quantity was about 
150 percent excess over that required to make 10 
liters of gas as calculated from the above equation. 
It was found desirable to use this excesss in order to 
prevent the reaction from becoming too sluggish as 
the reagents were consumed. 

After each run was completed, a sample was with- 
drawn from the collecting bulb by means of the 
Toepler pump, and added to an evacuated sample 
tube. These samples were subsequently analyzed 
mass spectrographically to determine purity of the 
hydrogen deuteride. This was found to be 98.1 
and 97.0 percent, respectively, for the two batches 
prepared. 


IV. Distillation of Hydrogen Deuteride 


Since about 935 ml of hydrogen deuteride gas at 
20° C is equivalent to 1 ml of liquid at the boiling 
point of hydrogen, 20.34° K, one 5-liter bulbfull 
of the crude as prepared was used as a convenient 
charge for each distillation. The entire 15 liters 
was therefore distilled in three batches, of which only 
one will be deseribed in detail. 

The collecting bulb, S (letters in this section refer 
to fig. 2) containing the charge at atmospheric 
pressure was attached to the manifold, as shown, and 
the entire system evacuated by means of a mercury 
diffusion pump. The Dewar flasks surrounding the 
still were filled with liquid hydrogen and _ liquid 
nitrogen, stopcock FP was closed, and stopeocks 
UO, P, and Q opened; the regulator valve, /7, was 
also opened wide by means of solenoid K. The 
contents of S were thereby permitted to flow into 
the still, where condensation occurred on the con- 
denser surface, C. After about 1 hour, the jacketed 
portions of the still had cooled sufficiently to permit 
liquid to begin accumulating in the boiler. The 
Toepler pump was then used to withdraw the remain- 
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Apparatus used for purification of hudroge n 


deuteride hy fractional distillation 


vas from S and to force it into the still; this last 
operation was continued until the pressure in the 
ih as read on the manometer (stopcock Q closed), 
was 50 mm. It was not considered practical, with 
regard to the amount pumped for the time con- 
sumed, to attempt to reduce the pressure any lower. 

stopcock P was then closed, and a current of 71 
ma permitted to flow through the heater, F. This 
current. was controlled by means of a variable 
transformer; since the heater had a resistance of 
27 ohms, it was equivalent to a heat input of 0.136 
watt, or 0.083 calsec. The charge was thus caused 
to boil under total reflux, and, as the hydrogen 
present in the still was concentrated in the head as a 
consequence of fractionation, the pressure indicated 
on the manometer rose from an initial value of 
47 to 531 mm. This rise occurred in about 45 min 
and did not increase thereafter. (Had there been 
sufficient hydrogen in the charge, the pressure at 
equilibrium should have been about atmospheric, 
or somewhat greater due to the fact that heat was 
being introduced.) 

During this time, stopeocks 2, 1°, and O were open, 
evacuating the Toepler pump and the bulb, S, 
for use as a receiver. Other, smaller, bulbs were also 
on the manifold, to be used as forerun and residue 
receivers; these are not shown in figure 2. 

When equilibrium had been reached, as indicated 
by constaney of pressure, the regulator valve was 
adjusted to about half-open, stopcock O was closed, 
and stopcock ? opened. The mercury levels in J and 
\f were adjusted so that the pressure in the Toepler 
pump was about 160 mm less than that in the still. 
Take-off of distillate was thereby begun. 

Each time the bulb J filled, stopcock P was closed, 
stopcock O was opened, and the contents of the 
Toepler pump were discharged into the appropriate 
receiver. The points at which fractions were to be 
cut were determined by the pressure indicated on 
the manometer. This pressure was approximately 
the equilibrium pressure of the distillate at the con- 
denser temperature. Its absolute value was of little 
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Picture 3. Distillation curve of hydrogen deuteride 


concern; What was of interest during the distillation 
Was its variation, As can be seen from figure 3, 
the pressure-volume distilled curve for this distilla- 
tion, the bulk of the distillate was removed at 478 to 
480 mm. The pressure at any time was influenced 
by the level, &, of liquid hydrogen, the take-off rate 
at that moment, the ambient temperature, and the 
rate of boiling. 

During the course of the distillation it was neces- 
sary to adjust the regulator valve several times and 
to determine the proper pressure in the Toepler 
pump, in order to achieve the desired reflux ratio. 
The heat of vaporization of hydrogen deuteride is 
257 cal mole”! at its boiling point, 22.54° K [s] 
Since the heat input was 0.033 cal see~', the boil-up 
rate was 185 ml min”, at 20° C and 760 mm. The 
take-off rate was observed by noting the amount 
withdrawn in the Toepler pump, and the time taken 
to withdraw it. From these figures, the reflux ratio 
was calculated. Because of the nature of the adjust- 
ments necessary to the operation of the still (as the 
mereury level in J fell, the leveling bulb, AZ, had to 
be periodically lowered to maintain approximately 
the same head), the reflux ratio was variable. Dur- 
ing the withdrawal of the pure hydrogen deuteride 
fraction, this quantity ranged from 9.3:1 to 26.5:1, 
with an average of 13.7:1. 

In the distillation described above, the charge was 
equivalent to 4,600 ml of gas, at 20° and | atm, 
There was distilled 3,580 ml, of which 390 ml was 
forerun, 2,850 ml pure hydrogen deuteride, and the 
rest holdup. The volumes of the fractions were 
obtained by summing the volumes of gas discharged 
from the Toepler pump and correcting to 20° C and 
1 atm; these values are plotted on the distillation 
curve, figure 3. Upon completion of the distillation, 
the pressure in the bulb containing the pure hydrogen 
deuteride was measured on the manometer. The 
corrected volume as calculated from this figure was 
3,080 ml. It is believed that this value is more 
reliable than that obtained by summing the Toepler 
pump fractions, since the latter undoubtedly contain 
cumulative errors. 

The distillation was continued until the boiler 
and the lower coils of the fractionating section were 
dry. This left a holdup of about 1 liter of gas, since 
with such a small amount of material fractionation 
was impossible in this still. The forerun and holdup 
of the distillation were discarded. 





A sample of the main fraction was analyzed by 
means of the mass spectrometer, and was shown to 
have a purity of 99.S8-pereent hydrogen deuteride, 
disregarding a small percentage of nitrogen. The 
remainder of the erude hydrogen deuteride was 
distilled in the manner deseribed herein, and yielded 
material of about the same purity 


—__—___—_ 
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Linear Thermal Expansion of Artificial Graphites 
to 1,370° C' 


Milton D. Burdick, Benson Zweig, and R. Eugene Moreland 


In connection with creep studies of graphite, the need arose for accurate thermal ex 


pansion data on the three grades of graphite being investigated 


Therefore, the linear 


thermal expansion of specimens of the three grades, cut in three directions with respeet to 


the direction of extrusion or of molding pressure 
Results show that specimens cut perpendicular to the 


between 20° and 1.370° C in vacuum. 


extrusion direction or parallel to the direction of molding 


Was determined at various temperatures 


ywressure had greater expansions 


than specimens cut parallel to the extrusion direction or perpendicular to the molding pres- 


sure. The data obtained are tabulated. 


I. Introduction 


connection With creep studies of graphite, the 

arose for accurate thermal expansion data 
within the range from 980° to 1,370° C on the 
particular graphites being studied. The data were 
necded primarily to correct for the thermal ex- 
pinson that would result from changing the test 
temperature during elongation or creep tests of 
vraphite in tension. 

Because of the marked anisotropy of the graphite 
crystal? °*° any departure from a random orienta- 
tion, as may result from the forming process, would 
be expected to have an effect on the thermal expan- 
sion in various directions with respect to the direc- 
tion of forming. Specimens for creep testing were 
cut both perpendicular and parallel to the direction 
of extrusion or of molding pressure. Therefore, 
specimens for thermal expansion determinations 
were cut similarly to duplicate the crystal orientation 
of the creep specimens. 

In order that the thermal expansion data might 
be of more general use, expansion values were 
obtained while heating from room temperature to 
i.370° CC. and contraction values were obtained 
during cooling from the maximum temperature 
to 400° C 


II. Materials 


Three grades of artificial graphite were obtained in 
the form of blocks. For each grade, at least two 
specimens were made representing each of three 
orientations with respect to the direction of molding 
pressure or of extrusion. Additional specimens were 
used to check the reproducibility of the testing 
equipment and the uniformity of the block samples. 


project was sponsored by the NEPA division of the Fairchild Engine and 
A e Corp 
Nelson and D. P. Riley, The thermal expansion of graphites from 15°C 
Part I experimental; D. P. Riley, The thermal expansion of graphite 
theoretical, Proc. Phys. Soc. London 57, 477 (1945) 
ies Pierry, Physique etude sur Ia dilattaion du graphite Comp. Rend 
mie Des Sciences, Paris 223, 501 (1946 
r Hidnert, Thermal expansion of artificial graphite and carbon, J. Re- 
\ BS 13, 37 (1934) R P4693. 
r Hidnert and W. T. Sweeney, Thermal expansion of graphite, BS Tech 
1927). 


The specimens were right evlinders machined to ap- 
proximately “in. in diameter by 2 in. in length. 
Figure 1 shows how the specimens were cut from 
the blocks. The cross-sectional area of the blocks 
perpendicular to the direction of extrusion or of 
molding pressure represents the die opening for the 
extruded Grades A and B and the mold size for 
Grade C 


III. Apparatus and Methods of Test 


The dilatometer used in determining the thermal 
expansion of graphite was similar to that used in an 
investigation of zirconia.” The dilatometer con- 
sisted of an alumina base into which three artificial 
sapphire rods were cemented to support the dial 
gage and a fourth sapphire red centrally located to 
provide the mechanical linkage between the dial 
gage and the evlindrical graphite specimen, which 
rested on the alumina base. Dial gages reading 
either to 0.0001 in. or to 0.01 mm were used. Gage 
readings were corrected for the expansion of the 
sapphire. The values for the expansion of the sap- 
phire were taken from a determination from room 
temperature to 1,000° C by the interferometer meth- 
od and extrapolated to 1,400° C (see footnote 6) 
The specimen and the lower part of the dilatom- 
eter were heated in a wire-wound furnace, the 
inner winding of which consisted of platinum 20 
percent rhodium and the outer winding of Chrome! A. 
The furnace and the dilatometer fitted under the bell 
jar of a vacuum coating unit.” The shield clamped 
to the tripod below the dial gage never reached a 
temperature above 70° C during a test. The equip- 
ment is shown in figure 2. 

The measurement of the temperature offered 
greater difficulty experimentally than did determina- 
tions of the change in length. The temperature near 
the specimen in the first 22 tests was measured with a 


*R. F. Gellerand P. J. Yavorsky, Effects of some oxide additions on the thermal 
length changes of zirconia, J. Research N BS 35, 87 (1945) RP 1662 

’ Manufactured by Distillation Products Inc., Rochester, N. Y. 
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30 gage Pt-Pt 10-percent Rh thermocouple in a 
closed-end protection tube extending radially through 
the furnace to the specimen. The protection tube 
was needed to prevent reaction between the graphite 
and the thermocouple metal. This reaction, the 
nature of which was not determined, had been ob- 
served in tests at temperatures above 980° C. In 
later tests, it was found that chromel-alumel thermo- 


couples could be used without a protection tube. 


Although a new base-metal couple was used in 
each test, those removed were bright and could 
probably have been reused since subsequent calibra- 
tion disclosed no deleterious effect. 

Before making a test, the bell jar was evacuated to 
less than 0.05 mm of mercury by mechanical pumps. 
At the beginning of the test the oil diffusion pump 
was started. The vacuum attained by the combina- 
tion of mechanical and oil diffusion pumps was 
usually less than 2107 mm of mereury during the 
test. The heating rate was then adjusted to 3° C 
per minute, and measurements of expansion were 
obtained at 30° C intervals by observing dial read- 
ings through the glass wall of the bell jar. The 
maximum temperature of 1,370° C was held for 45 
min after which the specimen was cooled to 400° C 
at the same rate of 3° C per minute. Air was admitted 
when the specimen had cooled to 400° C to permit 
reaching room temperature by the next morning. 

For tests 1 through 22, for which a closed-end 
protection tube was used, the thermocouple had to be 
calibrated at the prescribed rate of heating to obtain 
true temperature readings. It was found that, 
because the protection tube could not be placed in 
the same position for each test, an error resulted 
in the temperature readings especially at the begin- 
ning of the tests. The effect of this error was reduced 
by averaging of the values. 

The average deviation between expansion values 
determined under similar conditions for the same 





Equipment used for determining thermal expansion 
of graphite. 


Figure 2. 


grade and orientation of graphite specimens was 
found to be +0.02 percent (see table 3). 

A test of the apparatus was made by measuring 
the expansion of a single crystal of periclase (MgQ) 
to 1,370° C in vacuum and comparing the results 
with a previous dilatometer measurement on the 
same specimen heated to 1,700° C in air (see footnote 
6). The comparison is given in table 1. 


TABLE 1. Comparison of the linear thermal expansion data 
obtained on the same MgO specimen in vacuum and in at 


Linear thermal expansion 


Room tem- - 


perature to 


Vacuum * Air ' 


‘ Percent Percent 

100 oOo O06 

200 16 

300 2 0 

400 7 3 

Oo 5 

aoo 

TOO 

SOO 

900 1.17 
. 000 1. 32 
. 100 1. 48 

200 1.4 
, 00 1. SI 

1, 370 1.93 


* No loss in weight was found as a result of heating in a vacuum. 
> Geller and Yavorsky (see footnote 6). No data on cooling were reported by 
these observers. 


Another check of the apparatus was obtained by 
| comparing the dilatometer measurements with those 
obtained by the interferometer method between 
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grade 


The agreement 


temperature and 400 
A graphite is 


ied with specimens of 
in table 2. 


IV. Results and Discussion 


linear thermal expansions and contractions 
ssed in percentage for 100-deg intervals from 
temperature to 1,370° C, and the average 
ients of expansion and contraction per degree 
rade over 200-deg intervals, are given in 
3 for each of the three grades of graphite and 
three directions with respect. to forming 
(one parallel and = two perpendicular). 


he 


ire 


Values are given for at least two specimens repre- 
senting each grade and orientation with respect to 
the direction of forming. Also, tests were repeated 
twice fer specimen X, of grade A and onee for speci- 
men A, of grade B to show the reproducibility 

Moreover, specimens X, and X, of grade A 

taken from the inside of the block (see fig. 1 
compare with specimens A, and A, of grade A from 
the outside face of the block, both pairs of specimens 
cut parallel to the extrusion direction. The average 
of the linear thermal expansion values from table 3 
are shown in figure 3. Average coefficients of ex- 
pansion computed from the curves in figure 3 are 
given for intervals of 200-deg centigrade in table 4. 


were 


lo 


Comparison of expansions of grade A graphite determined by the interferometer and the dilatometer methods 


Orientation Pemperature range 


RT «to 100 
100 to 200 
200 to 300 
300 to 400 
[RT to 400 
RT to 100 
100 toe 200 
200 to 300 
400 to 400 
RT to 400 
RT to 100 
100 to 200 
200 to 300 
{00 to 400 


RT to 400 


to extrusion direction At 


extrusion direction L, M 


ndicular to 


ndicular to extrusion direction U* 





rage of 2 companion specimens 
orientation diagram, fig. 1 
room temperature 


The differences in the expansions perpendicular 
and parallel to the direction of forming, for the three 
vrades of graphite, are attributed to the different 
degree of preferred orientation, and to the marked 
anisotropy of the graphite crystal. The positions 
of the erystals depend on the method of forming. 
With the extruded graphites, the C axes of the hex- 
agonal crystal plates tend to be alined perpendicular 
to the sides of the die, this tendency being a function 
of the size of the grains, the density of the mass, 
the extrusion pressure, the size and shape of the die, 
In the case of the molded graphite, the crystal 
plates tend to be alined with the C axes parallel to the 
sides of the mold. Since the direction of greatest 
expansion of the graphite crystal is parallel to the 
( axis (i. e., perpendicular to the basal cleavage), the 
specamens cut parallel to the direction of molding 
pressure and perpendicular to the extrusion direc- 
tion showed the greater expansion. As is generally 
the case with clay bodies, the erystals are preferen- 
tially oriented to a greater degree by extrusion than 
by molding. This accounts for the fact that in the 
case of graphites A and B the difference in expansion 
between specimens cut parallel and perpendicular 
(to the extrusion direction is greater than the differ- 
ence for comparable specimens of molded graphite C 


ete 


Coetlicients of linear thermal 


eXpansion Linear thermal expansion 


Interferometer Dilatometer Interferometer Dilatometer 


lrercent 


In the case of 


grade B graphite, the two pairs of 
specimens (U, T and Ly, L.), which were cut perpen- 
dicular to each other and to the direction of forming, 
differed significantly from each other in their linear 


thermal expansion, For comparable specimens of 
grades A and C, this difference was not significant. 
The large and rectangular die opening used in ex- 
truding the grade B graphite may have allowed a 
different. degree of orientation of erystals between 
die faces than the relatively small square opening 
used for extruding grade A. This same explanation 
may account for the uniformity of expansion values 
for specimens of grade A cut parallel to the extru- 
sion direction, despite the fact that specimens X, 
and X, were cut from the center of the block. Note 
in table 3 the uniformity of the expansion values 
for various specimens of grade A graphite cut per- 
pendicular and parallel to the extrusion direction 
for tests in which the temperature was measured in 
the same manner, that is, with the protected thermo- 
couple or with the. bare thermocouple. 

The results of the determinations for indicating 
the reproducibility are given in table 5. Comparison 
shows that the values do not deviate materially. 
Note that in test 44 the temperature was maintained 
for at least 1 hour at each of 10 temperatures. This 
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Fictre 3 Average linear thermal expansion of three graphites. 


A graphite: 1, average of four specimens perpendicular to extrusion 
2, average of four specimens parallel to extrusion direction. Grade B 
graphite: 3, average of two specimens, ( and 7, perpendicular to extrusion 
direction, 4, average of two specimens, LZ, and Ly, perpendicular to extrusion 
direction; 5, average of two specimens parallel to extrusion direction; grade C 
graphite: 6, average of four specimens parallel to molding pressure; 7, average 
of four specimens perpendicular to molding pressure 


(irack 
direction 


was done to evaluate the lag in temperature meas- 
urements made with the bare  chromel-alumel 
thermocouple. The expansion measurements show 
that above 600° C the values during the regular test 
(43) were slightly larger in every case, indicating a 
temperature measurement lag. However, the dif- 
ference does not materially affeet the expansion 
values. 

No attempt was made to compare the thermal 
expansion determinations of the three graphites 
investigated in this work with determinations of 
other graphites because of the wide variation of 
values found in the literature. The variation is 
attributed to the anisotropic characteristic of the 
erystal and the structural nature of the mass. The 
structure of a graphite body is unlike that of a 
metal or a porcelain body. The flat graphite 
crystals do not form impermeable solids, but rather a 


low density porous body. The many voids decreas 
the total thermal expansion considerably. Th 
total effect would depend on the grain size and por 
distribution. 

The work of Nelson and Riley (see footnote 
and Pierry (see footnote 3) indicates the degree « 
anisotropy of the graphite crystal. Most of th 
thermal expansion of the crystal is due to the ey 
pansion in the “C” direction. Along the “C 
axis, the former observers (see footnote 2) found tha 


the coefficient of expansion was about 29> 10>°, an 
along the “A” axis only 0.9% 107-° between 600 


and 800° ©, 


Pance 5. Linear thermal er pansion data obtained from re pee 


tests® of two grade s of graphite 


(irade A specimen X (Cirade B specimen A 


Pemperature range 


Test 44 Test 2s Pest 40 


Pest 43 
Linear thermal expansion 


c Percent 
RT te 100 ow 
RT to 200 ol 
RT to 300 Os 
RT to 400 On 
RT to 500 oT 
RT to 600 Ww 
RT to 700 13 
RT to 800 i 
KT to #00 14 
KT to 1,000 22 
KT to 1,100 25 
RT to 1,200 2s 
RT to 1.300 {2 
RT to 1,370 uh 


RT to 20 
200) to 400 
100 to 600 
100 to 800 
SOO to 1.000 
1.000 te 1.200 
1.200 to 1.370 


* Tests 43 and 44 are first and second heatings, and tests 28 and 40 are second and 
third heatings 

> Test 18 was not included because of the expected difference bet ween expansio 
values due to the different method of measuring temperature 


¢ RT =room temperature 


V. Summary 


1. The linear thermal expansion of specimens of 
three grades of graphite, cut in three directions with 
respect to the direction of extrusion or of molding 
pressure, was determined at various temperatures 
between 20° and 1,370° C in vacuum. 

2. Specimens cut perpendicular to the extrusion 
direction or parallel to the direction of molding had 
greater expansions than specimens cut parallel to the 
extrusion direction or perpendicular to the molding 
pressure. 

3. It was found that chromel-alumel thermocouples 
could be used successfully in vacuum at temperatures 
well above those at which they fail in air. 


WaAsHINGTON, September 13, 1950, 
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Research Paper 


The Absolute Standardization of Radioisotopes 
by 4. Counting 


H. H. Seliger and L. Cavallo 


The absolute counting of beta particles, using 4 * proportional flow counters, is described. 
This method of absolute counting has been adopted in the Radioactivity Section of the Na- 
tional Bureau of Standards for the primary calibration of radioisotope solution standards 
Pillbox, eylindrieal, and spherical 4 * flow counters have been used in the measurements, and 


excellent results have been obtained for each. 


\ simple method of souree mounting has been 


developed, and a formula is derived that determines the absorption in the source mounting 
regardless of knowledge of the thickness of the mounting or energy spectrum of the beta 


rays. 


Results are given of intereomparisons of P® and I! with other laboratories in the 


United States, Canada, and Great Britain that employ different types of absolute counting 


techniques. 


The applicability of the 4 * counting method to the standardization of practi- 


cally all beta emitting radiosotopes is discussed. 


I. Introduction 


} « proportional flow counters have been used at 
the National Bureau of Standards for the determina- 
tion of absolute disintegration rates of radioisotopes 
since September 1949.'— The principle of 2 # and 4 x 
counting is well known to workers in the field of 
radioactivity. However, several improvements have 
been made in the measurement technique, and since 
the method has not been covered previously in any 
detail it was felt that a description of the technique 
used would be of practical interest. This is partieu- 
larly true in view of the excellent agreement that has 


been obtained with other published absolute counting 


methods.? * # 


II. Technique 


To eliminate the difficulties associated with Geiger 
counters and still retain somewhat their pulse limit- 
ing property, the 4 x counter is operated in the region 
of limited proportionality.®* Figure 1 shows the 
disassembled evlindrical and pillbox 4  counters.* * 
The insulators are of Teflon. The stainless steel 
evlindrical counter was designed with the length 
equal to the diameter, so that the minimum path 
length for a particle emitted in any direction from 
the center of the counter would be as close as possible 
to the average path length. A 0.001-in. conducting 
diaphragm, also visible in figure 1, fits between the 
two halves and serves to divide the cy linder into two 


ful acknowledgement is made to C. J. Borkowski of the ORNL, who 
ted this method at a meeting of the Subcommittee on Beta and Gamma 
surements and Standards of the Committee on Nuclear Science of the 
nal Research Council in June 1949 
secount of the theory and experimental application is given by L. H 
British J. Radiology 22, 477 (1949 
herical collecting geometry is described by R. K 
ty, Rad. Res. Lab., N. Y. O.—1506 (March 15, 1950 
illel plate collecting geometry is described by W. Gross and G 
79, 209 (1950 
Novey, Rev. Sei. Instr. 21, 280 (1950 
Korff, Electrons and nuclear counters (D. Van Nostrand Co., 1946 
and Staub, lonization chambers and counters (MeGraw-Hill, 1949 
and Staub, lonization chambers and counters, p. 130 (McGraw-Hill, 
ustrate a 4 # counter used for @ particles 
ram of a 4 x counter is shown in a review article on beta counting of 
nam, United Kingdom, AERE-N R-318 (Feb. 24, 1949 


Clark, Columbia 


Failla, 


YSSS51-—-51 


equal hemicylinders so that the top and bottom 
halves function independertly. The source ts 
mounted on the polystyrene face of a laminated film 
of Formvar and polystyrene approximately 40 
wg/em in thickness, and this thin film is cemented 
over the diaphragm center hole. 

99 Mole percent of methane has been used as a 
flow gas in all counters and has been found to be 
preferable to an Argon-Methane flow mixture. Both 
the Bell Jordan linear amplifier "' and the Brookhaven 
nonoverloading amplifier? have been used 

Two comparison tests are made on all readings 
taken with the 4 # counter. First integral discrim- 
ination curves are plotted for various electronic gains, 
keeping the counter voltage constant, and second, 
these discrimination curves are plotted for various 
counter voltages or gas gains, keeping the electronic 
gain constant. In both cases if the discrimination 
curves are flat, one can assume that all primary beta 
particles are being detected, 

The curves shown in figures 2 and 3 illustrate the 
completeness of the electron collection and also the 
method of extrapolation to find the true disintegra- 
tion rate. Even though the discrimination curves 
have been found in practice to be flat, it can be as- 
sumed that some primary beta particles are not de- 
tected, their number being too small to contribute 
appreciably to the slope of the curve. The gas am- 
plification in this region is 10’, and the electronic gain 
is 5000. The capacity of the counter circuit is ap- 
proximately 10°" farad. Therefore a primary beta 
ray must create in the neighborhood of 12 ion pairs 
in order for its pulse to be detected, assuming that 
an output pulse must be at least 10 volts in height 
This would preclude from measurement all primary 
beta ravs with energies less than 400 ev © which ts 
a negligible fraction in the cases considered. 


L. M. Fry and R 
14s 

P. R. Bell and W. H. Jordan, Rev. Sei 

2W. Bernstein, H. G. Brewer, Jr. and W 

This agrees roughly with the calculations of RK 
munication 


lr. Overman, Atomic Energy Com. Doc Ine Jan. 30 
Instr. 98, 703 (1947 

Rubinsin, Nucleonies 6, 39 (1950 
Ballentine, private com 





Ficure 1 Disassembled cylindrical and pillbor 4 * counters. 


In the foreground are the 
he relative 


The “0” ring Teflon insulators are clearly visible 
diaphragms and the 0.001 in. aluminum source mounting ring 
seale is given by the feentimeter rule] in the feenter 


III. Source Mounting 


Experiments have been conducted to determine 
the amount of absorption, if any, due to the film 
between the source and the lower half of the 42 
counter. The number of particles counted by the 
top half of the counter connected separately will be 


Bb l. (1) 


where .V, is the true disintegration rate of the source, 
By is the percentage backscattering from the film, 
r is the fractional absorption in the film, and P,,(6) 
is the percentage backscattering due to the walls in 
the bottom half. The number of particles counted 
by the bottom half connected separately will be 


N 


Noeettom \(1 


From symmetry considerations By (tf) By (b) = Ry. 
The factor Pe can be neglected when the film is 
thin and of low atomic number, so that (1) becomes 
: No 
Nie== (I 


In the 
obtains 


bottom half, again assuming /’, 


{a T 


+ By} 


Putting this in the form y=<ar+ 6 gives 


: No N 
N bottom ») ibs 2 » 


- (i By : (5) 


With the thin films under consideration it can be 
assumed that the absorption is directly proportional 
to the film thickness. Equation (5) can be used to 
determine the absorption correction graphically. A 
more direct method of determining 7 can be deduced 
from eq (3) and (4). 


























5 
DISCRIMINATION LEVEL 


Ficure 2. Counting rate as a function of discrimination 


for increasing steps in amplifier gain. 


The source is P™ in the pillbox 4 # counter The voltage on the counter 
held constant during all runs. Gain 2 is twice that of gain 1, and gain 
times that of gain 1 


N top N bottom 


The actual counting rate observed with 
bottom halves connected together Is 


Na No | l 7/2) 
Nottom 


so that (6) becomes, if one lets Nw 
Niw } A 
Nu Neetvem . les N, 
and similarly 


By Nin 


The absolute counting rate is then obtained by sub- 
stituting (8) into 


Thus by taking three different readings of the same 


source on a single film it is possible to determine the 


absorption by the film. This proves extremely 
useful for low energy beta particles. 
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». Counting rate as a function of discrimination level 
for increasing steps in voltage. 


1 in the evlindrieal 4 # counter 
iiserimination curves 


The electronic gain was held 
were run for various voltages on the counter 


Experiments have been performed with P®, [™ 
and CO” using film thickness in multiples of approxi- 
mately 40 ge em?®, and excellent agreement has been 
obtained between eq (5) and (9). As an example 
of the corrections necessary for absorption, for a 
film of 40 pe/em? thickness 7—0 for P*, 70.01 for 
and r—0.05 for Co In the cases of 1!, Co” 
and ©’ polystyrene films as low as 10 ywg/em* in 
thickness have been used. More complete data 
are now being taken for Co®, C', and Na®. 

Previously when thin films were used as source 
mounts it was thought necessary to sputter a thin 
conducting laver of gold on both sides of the film so 
as not to disburb the collecting fields. However, if 
the area of the dielectric film is made small compared 
to the total area of the conducting diaphragm these 
collecting fields will be perturbed only slightly with 
no loss in efficiency. All results shown for the evlin- 
der and spherical counters have been obtained using 
nonconducting plastic films, and table 1 shows the 
agreement for P® between the cylinder counter in 
Which such thin films were used as source mountings 
and the pillbox counter in which the sources were 
mounted on a conducting film of aluminum. 


IV. Source Thickness 


Equation (10) previously derived is valid, provided 


there is no self-absorption in the source itself. In 


any case the effect of self-absorption in a source of 


the same thickness as the mounting film is not as 
serious as the absorption in the film. The source is 
an evaporation covering a small area, and therefore 
there is a slowing down of electrons rather than com- 
plete stopping, as in the case of the film where some 
of the emitted electrons can have a long path length 
Since the specific ionization curve has a maximum at 
approximately 7,000 ev, this slowing down usually 
adds to the total ionization. The effeet of self- 
absorption will be more important in the cases of 
Co”, C™, and S*, 


V. Results and Conclusions 


P* and I! have been calibrated in all 
three types of 4 zcounters. The internal consisteney 
of repeated measurements in all cases is within | 
percent. The results of some determinations made 
using the three types of counters are shown in table 1. 

It is felt that the cylindrical 42 counter has 
slightly better collecting fields than either the sphere 
or the pillbox. This does not affect in any way the 
absolute counting of the beta particles, but is evi- 
denced by a slightly longer voltage plateau for the 
evlinder. However, the spherical counter is by far 
the most economical, the total cost including modifi- 
cations and mounting stand being approximately 
$80.00." 

The evlindrical 427 counter is at 
used for all standardizations 


SOUrces 


present being 

The results of a series of intercomparisons of P” 
and ['' solutions originally calibrated at the National 
Bureau of Standards and distributed to other labo- 
ratories are given in table 2. 

The technique described above is applicable to any 
beta emitting radioisotope, regardless of the com- 
plexity of the 


states or gaseous 


lsomeric 
difficulties 
However all effects due to gamma-ray line spectra, 
gammia-ra\ 


disintegration scheme. 


Isotopes Tay cause 


conversion electrons, or electrons pro- 
duced in the gas or the walls of the counter by gamma 
ravs only add to the total ionization per disintegra- 
A high 
specific activity source is not required, since such 
low counting rates ean be recorded that water dilution 
is practically always possible. Most important, the 


method is self-consistent. Any improper mounting 


tion and are counted as one ionization pulse 


or positioning of the souree is shown up immediately 
by a large value for 7. Any other malfunctioning 
will not result in flat discrimination curves. There 
are no further meter or condenser calibrations neces- 
sary. Equation (10) gives the true disintegration 
rate regardless of changes in temperature, humidity, 
pressure inside the chamber, or voltage 

The 4 x counter method is at present being used to 
calibrate sources of C’, Na”, and Co As soon as 
practicable S*, Na®*, Au", and Sr® will be calibrated. 


oo 


‘ Hemispheres are available f 
Ave., Indianapolis, Ind 
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Tare | Results of comparison measurements made of P®™ and 1" in all three types of 4x counters 


[The errors indicated are statistical standard deviations} 


Isotope and date of distribution Counting rate lype of counter Type of amplifier Resul y 


Counts dps/mix 
j Less than 100 Pillbox Bell Jordan 1+? 
| do Cylinder do 208+ 1 
do do do 29+0.08 
| do do Brookhaven 241+0.40 
do Sphere Bell Jordan 240+0.24 
Approximately 900 Cylinder Brookhaven Z0+1.0 


Tarie 2. Results of independent intercomparisons of P™ and 1 made by other researchers using absolute counting techniq 


lhe errors, except for Dr. Hawkings’ figures, are estimated inaccuracies in absolute value 


Activity dps/ml x10 


Isotope and date of distribution 
H. Gray ioniza- R.C. Hawkins 4 « Oak Ridge defined G. Failla par 


NBS unter 
BS 4x counte tion chamber counter * solid angie plate chamt 


is 4w ws. 5 248+ 

227/00 i274 241.4 

ws0 y+ w4t1.7 

wise 2 7 r20+ 5 24541 Mt5 4147 


kK. C. Hawkings, Can. Nat. Research Coun., Chalk River, Canada, uses a spherical 4 * counter similar to the NBS counter for his determinations 


The authors are indebted to R. C. Hawkings for | given to L. Costrell of the Nucleonies Instrumen- . 

his cooperation in the intercomparisons of the P™ | tation Section for the construction of the eleetrony 

and 1"! solution standards, and to L. H. Gray, | equipment, for his valuable assistance in setting up 

Hammersmith Hospital, London, Eng., G. Failla, | the pillbox 4 # counter in September of 1949, and for 

Columbia University School! of Medicine, E. 1. Wyatt | his continued interest. 

and J. A. Cox of the Oak Ridge National Laboratory a 

for their interest and cooperation, Thanks is also Wasuincron, December 18, 1950, ’ 
Su 
obt 
ver 
Ist! 
why 
Ma 
to | 


In 
iter 
and 
roo! 
arb) 
be 
wis! 
mal 
iF 
how 
the 
vere 





{ Research of the National Burea 


Research Paper 2227 


A Method of Gradients for the Calculation of the 
Vectors of a Real Symmetric 


Characteristic Roots and 
Matrix’ 


Magnus R. Hestenes and William Karush 


Let A be areal symmetric matrix, u(z). 
and £(r) the gradient vector of u(z). 
of the operation 7=2—a(z) &(z). 


choices for a(x), and the rate of convergence is 


The Rayleigh quotient formed with a vector + 
The method of gradients consists in an infinite iteration 
The convergence of the procedure is proved for several 


studied extensively for one particular a(r 


The directions of the vectors in the sequence are seen to tend to that of the characteristic 


vector belonging to the lowest characteristic val 
determination of all characteristic vectors and v 


I. Introduction 
With a real symmetric matrix 


A=(4a,,;), €ij=6y:, 3 


is associated the Rayleigh quotient 


(Ar, zr) 


(r,x)’ 29, 


u(r) 


whose critical points y are the characteristic vectors 
of A. The gradient of » has the direction of * 

E(r)= Ar—ypr. (1) 
Suppose now, given a vector 7, we wish to modify z to 
obtain a better approximation # to a characteristic 
vector You Corresponding to the minimum character- 
istic root Ammo min pr). It is natural to form 


aég, 


0, (2) 


Tr a v4 
where a may depend upon sz. Similarly, to approxi- 
mate to a characteristic vector Yo, Corresponding 
to X, max u(r) we form 


aX 


zr=2r+a, a>. 

In the present paper we describe several convergent 
iterative methods based upon this gradient process, 
and investigate the convergence to the characteristic 
roots and vectors of A. The results apply to an 
arbitrary, real symmetric matrix. The methods can 
be phrased to yield directly ymin OF Ymax, aS One 
wishes. For convenience we direct our attention 
mainly to Ymin- 

In prescribing a gradient method one must specify 
how the number a is to be chosen at each stage r' of 
the iteration. It is shown that the vectors sr‘ con- 
verge to the appropriate characteristic vector if @ is 





a 





, 'he preparation of this paper was sponsored (in part) by the Office of Naval 


mvenience we shall refer to £ as the gradient 
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ue. The method can be used for a numerical 
alues. 


any positive constant (independent of 7) less than 
2/M, M=Xwax— mm. The bulk of the theory is con- 
cerned with this “method of fixed a.” If we impose 
the stricter requirement a<1/.\/, we obtain in addi- 
tion that the gradients &' converge in direction (i. e., 
the unit vectors £'/ &° converge) to a characteristic 
vector; in fact the method will be generalized to yield 
all of the characteristic vectors of A. As would be 
expected, the nature of the convergence is essentially 
geometric. 

A second “method of optimum a” is treated in 
which a, which now depends upon s, is selected in a 
certain “best”’ way. In this method the approxima- 
tions s' converge to a characteristic vector, but the 
gradients £' fail to converge in direction. 

The well-known method * of forming powers .1‘r 
can be interpreted as a gradient method in which a@ is 
chosen as —1/u. Here convergence, in general, is to a 
characteristic vector corresponding to the character- 
istic root of maximum absolute value. We remark 
that, commonly, the gradients ‘ converge in direction 
to a characteristic vector corresponding to a root of 
next highest absolute value. 

The chief virtue of the gradient methods seems to 
lie in their simplicity. They are not put forth as 
rapid procedures for a hand computer, but rather as 
processes that might be adapted to automatic com- 
puting machinery. A survey of methods for calcu- 
‘ating characteristic roots and vectors of (more 
general) matrices may be found in Hammersley.’ 


II. Properties of Symmetric Matrices 


In this section we collect for reference some well- 
known facts on symmetric matrices. 


) An extension of the method of optimum a@ to more general problems has been 
outlined by L. V. Kantorovitech, On an effective method of solving extremal 
problems for quadratic functionals, C. R. (Doklady) Acad, Sci., URSS (N. 8.), 
48, 455-460 (1945 These results are closely related to some unpublished work of 
M. R. Hestenes 

4 See H. Hotelling, Analysis of a compk 
components, J. Educ. Psych. 24, 417-441 
Hotelling treats the symmetric matrix 
matrices, see A.C. Aitkin, Studies in practical mathematics 
of the latent roots and latent vectors of a matrix, Proe. Roy 
57, 269-304 (1937 

The numerical reduction of non-singular matrix pencils, Phil. Mag. 4@, 783-807 
(1949). 


x of statistical variables into principal 

and 498-520 (1983 In this paper, 
For the extension to nonsymmetrie 
Il. The evaluation 
Soc. Edinburgh [A] 





Consider, for the moment, the space of complex 
vectors £ (b,, b., . ., b,) over the field of complex 
numbers. A number A is called a characteristic root 
(number, value) of an arbitrarv matrix 72 in case 
there exists a nonnull vector « such that 


Ba Az. 


The vector « is called a characteristic vector: we shall 
suv that it belongs to the characteristic root A. 

For a real symmetric matrix A the characteristic 
roots are real and the characteristic vectors can be 
chosen to be real. Accordingly we henceforth limit 
ourselves to the field of teal numbers and to real 
vectors We let / denote the space of real vectors 
rv. We use “(2, y)” for inner product, and“ 2” for 
length in this space. We note a fundamental 
property of AA 


(sla, y r, aly 
Let the distinet characteristic roots of «1 be de- 


noted by 


(eliminating the trivial case p= 1 With each A, is 
associated the linear subspace . / of 4 which is the 
set of all characteristic vectors belonging to Aas. 
tovether with the null veetor. The dimension of 
#4. is the order of A,. The subspaces . 4% are 
mutually orthogonal and span the space °/ 
To each nonnull vector z we attach the number 


‘ ly, 


(x.2) 
The function » is homogeneous of degree zero; 
u(bs) u(r), 4x0. 


For a characteristic vector y belonging to a charac- 
teristic root A, 
uly) 


We have the following well-known relations between 
pw and the characteristic roots. 


A,;=min p(s), A,=max u(z), r#0 
z z 
or, equivalently, 
A,=min p(s), A, 
z 


max u(r), 
z 


More generally, 


A,=ming(r), x #0andorthogonalto. 4,...,. 4 1; 


Ay=maxy(z), x2#O0andorthogonalto. 4,,,...,.4. 


To study the behavior of subspaces of .-/ under 
the matrix A it is convenient to think of A as a 


linear operator, without regard for coordinate pr jpe- 
sentation. In general, if B is a symmetric |) (oy, 
operator on a finite dimensional space .2, then {hp 
statements of the preceding three paragraphs 
(by “symmetry” is meant the property of 
Now suppose that .4 is an m-dimensional subs 
of «/ and let B be the operator A with do: 
restricted to 4 If 4 is invariant under A, j 
r belonging to 4 implies Ar belongs to 4, th 
is a symmetric linear operator on 4 As sue! 
has m characteristic roots (counting multiplicitie 
and the reots and vectors of 72 are roots and ve 
of AA. 

The characteristic roots of the matrix «A ar 
solutions of the polynomial equation 


AJ — A\=0. 


The multiplicity of a solution A, is precisely the dimen- 
sion of the subspace . A, i. e., the order of Ay Wi 
remark that if A is regarded as a linear operator 
then this equation, formed with any matrix represen 
tation of “A, \ ields the characteristic roots. 

In our work we shall be dealing always with a 
fixed initial vector 2°. Let 2° have nonnull proje: 
tions in .%,...,.%, and only in these subspaces 
4/.. We may represent 2° uniquely in the form 
QUYyite ++ +ayy, 


» 


ay >o, ly . e.Fe, §=1,2,--: 


(We exclude the triv ial case 7 * Let Wr be the 
dimensional subspace spanned by y,, .... y-: 


4 Yr Yor ° * * 5 Yr)- 


Then <% is an invariant subspace under A with 
orthonormal basis (y;, yo, -..,y). The character- 
istic roots of A relative to this invariant subspace 
(i. e. the reots of the linear operator A with domain 
restricted to /) are 

wy 
where 

N=Ae. fj 

In the subspace each root A, has order one, and has 
as corresponding characteristic vector, y,;. In the 
following pages, after having selected an_ initial 
vector 2°, and thus determined the invariant sub- 
space A, we shall be dealing exclusively with vectors 
in this subspace. 


III. Determination of « 


This section is discursive; the theorems of the 
following sections do not logically depend upon it 
A direct calculation shows that the gradient of u Is 


9 
ts, 3) [Az—p(e)z}, 
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ence has the direction of &, 
marked, we shall refer to 
ve the relations 


1). 
gradient, 


given by eq 
the 


c 


& as 


Ar,é 
‘gradient is the direction in which uw locally 
ses most rapidly. Thus if we form 7 from 4 
eq (2), with a >0 sufficiently small, we should 
u(F) to approximate A, more closely than 
Beginning now with 7 and, say, @, we obtain 
eq (2) a next approximation 7, ete. The 
on then is to specify @ systematically. 
direct computation we find 


flat, 


itural requirement on a(s 
ergence of uw to A, Is then 


order to expect 


0 


\ possible choice is a= const. with 


is the spread of the characteristic roots of A. Since 
V/ it is easily seen that tla) >0. The method 
of fixed a, stemming from this observation, is treated 
inseetions 5 and 6 and generalized in sections 10 
and FP 
(nother possibility for @ is that number 
which maximizes f(a). From f’(y)—0 we obtain 


y(r) 


t*y° + sy 0, 


from whieh 


Computation shows that f(y)=<y. Hence 


u(r)— p(T) = yt’, 


The method of optimum a, based on this approach, is 
treated in section 7. We note the following formula: 


l 


Y (11) 


u(r) 


£) 


a 
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To verify this write eq (10 y (yt? . 
and note that the quantity in parentheses equals 
M\S)— pt). 

Remarks on the power method, i. e., forming sue- 
cessive iterates ot, will be made in seetion S. In 
that section and in the later section 12 modifications 
of the methods of fixed and optimum will 
suggested. 

Before proceeding to any specific gradient method 
we formulate some general results in the next seetion 


IV. General Gradient Method 


we 


S 


cr be 


y such 
not a 
r“O0 and Ex 


We suppose given a real-valued funetion 
that ¥ is defined whenever «#0 and 
characteristic veetor, i. e., whenever 
We require that 


ir) r 


Is 


see, € 
eq 10) 
form 


v., the particular funetion y determined by 
Beginning with a vector 2’, expressed in the 
we construct the of 


fe a .. necording to 


sequence Vectors 


i 


¢ 
Ss 


. Where § 


t 
This is equivalent to 


yields 


3 where vi 

We must assure ourselves that av is well defined 
We note first that if ~Qand & 40, then su 
defined and not null. For, under the hypotheses, 
is defined and by eq (12), 


J Is 


t 


¢ 
s 


We thereby distinguish Iwo ivpes of SeqUELICeS, 
The first. a trivial that for some 
first integer i, 


case. is one such 


We terminate the induction at /— &; the sequence 
ro, tO, 1,++-, kis finite with the last vector s* 
a characteristic vector. The second type is charae- 
terized as a sequence that is not of the first type. 
In this case and are well-defined infinite 
sequences and 


tt 
s 


I 


0, 

To enable us to formulate statements that hold 
simultaneously for finite and infinite sequences we 
make the following agreements: (1) if p' is an 
arbitrary finite sequence, then lim p' denotes the 


Is 


last member of the sequence; (2) any statement or 
formula involving an index is to be limited 
meaningful values of the index. 

From eq (7), 


to 


pt—p'?! = f(y')t* >0, t! 





Since uw’ is bounded from below, it follows that 
there is a number v such that 


lim yu ,, (15) 


The vectors ' and & lie within the invariant sub- 
space «A—(y). Me. +++. y-)* their expansions in terms 
of the characteristic vectors y, are readily deter- 
mined from eq (1) and (12). We find 


AY, + ALYo + + + + aly, 
ae ' Ay) a 
A, a 


and 
ways 4 
Thus 


4 gi? 
n'a, Ls o ot (R. p')at. 

Theorem 4.1. Suppose y' is such that 

O< Ky, < fly’) and 0< yy‘, 


for some constant A,. Then 


° x . , 
lim Ties and lim p'=d). 


i 
We limit ourselves to the case when the sequence 
proof below applies as well to the finite sequence. 


From eq (14) 
l 


K, ‘* 


tie< t— yit), (19) 


Since u'r, it follows that 


Let 


Then by eq (18) 


Consequently 


lim (x, — py")?! 


Since (A;—w')-*(A,— ») it follows from the last equa- 
tion of (20) that for some specific value / of the 
index J, 


wid, «bj 1, bt-0 for 7 #1. 


We wish to show that / Suppose / 


From the monotonicity of uw’, we have x! 


From eq (16) 


oe 1+y(u'—A,)) (aj > ay 


: A;)| |at|~ \aj 


a," b+y"(u 


Hence 
a’ a’ ; 
'>,-.'>0 for i=0,1,2.--- 
ai|=la‘ 


This contradiction shows that / We now have 


ay 
' 


t i 


>r;, 


a,>a\>o0. 


The theorem now follows from the first ‘equation of 


| (16). 


Theorem 4.2. If, in addition to the hypotheses 
of theorem 4.1, the sequence y' satisfies the condi- 


| tion 


V<sk,, RK; 


const., 


then the conclusion of that theorem may be strength- 


mito Lhe se | ened to 
vr’) is infinite; by appropriate simplification the | 


lim s'=Ly, 
ina 


where L is a positive constant. 
By theorem 4.1 it is sufficient to show that rs 
From eq (12), 


i 


eet 2/7/21) + (yt) [1 +-(7't')4] 2. © (14+-07't)?]. 


It is well known that the product on the right con- 


, it . . 
verges if the series >) (y‘t')? does. From eq (19) the 
0 


series St' converges, and hence the first series does, by 
| the boundedness of y‘. This completes the proof. 


V. The Method of Fixed a 


In this gradient method we choose y(r) to be a 


| constant @ satisfying eq (8); thus, 


a f 0< B<2. 


It follows that 
x(2— B) 


‘(a)>* —y 
J We 1 +a‘t’ 





wl 


fc Tr 


tha 
tha 


1 ( 1) we derive 
t<2A gr 


\ is the bound of the matrix .1. Hence ¢ is 
| from above. Thus f(a) is bounded from 
y a positive constant. Theorem 4.2 is now 


le. 


em 5.1. 
ng eq (21). 


Let y' be chosen as a constant a@ 


Then 
lim s'=Ly,, 


lim p'=A), 


er use we note that 


Fo 


r' 
(22) 


i >I, 
a\ 


whieh was established in the proof of theorem 4.1. 


We propose to show now that under a strengthen- 
ing of condition (21) the gradients £ converge in 
direction to the second characteristic vector yp. 


The new condition is 


(23) 





Lemma 5.1. Under the condition (23) the sequence 
rs infinite and 


0, 0, 


From eq (16) 


a’ al! A, a 
By eq (23), the expression in braces is positive. Hence, 
from a’ >O, we have a >0. From the second equa- 
tion (18) and the assumption r>2, it follows that 
& #0. The last inequality assures that (7) is in- 
finite 

We introduce the following notation 


A,— Ay 


al d; aN 1) l news 8 \ 1 ? j 


Thus, under the condition (23), 


0. 6,<. 6,21 os a 6o< 6, 1. 


Lemma 5.2. Under the condition (23), 


i+! 


lim —— 
1 


ino d,; 


65 


- €, 
lim —=0, 
i+o a; 


for k. 


The first equation follows from (24) and the fact | 
that ud). To prove the second equation notice | 
that for j<k, 


a) b, 
~ and 0. 


a 


Henee, for any ¢ >0 there is a constant A’ and index 


i, such that 


a, é. . 
K(- +e); 


a 


Choose ¢ so that the number in parentheses is less 
than 1. This completes the proof. 
Lemma 5.3 Under the condition (23), 


. (p'—A,et 
lim * nl == (he Ay 
- a 


lim 


i 
From the orthogonality relation (r', £)=0 and 
eq (16) and (17) we obtain 


(A;—p')ay +(Ag—p')ay 4 +- (A, — pai =O. 


Divide by ai and take the limit as j->©. The 
first conclusion of the lemma then follows from 
lemma 5.2. Using this result we establish the second 
conclusion in a similar way by means of eq (18). 

Theorem 5.2. Let the constant be chosen 
satisfy (23). Then 


a to 


i--+ 


lim y(£"')= Ag. 


Divide both sides of eq (17) by a... Using lemmas 


5.2 and 5.3 we obtain 


vD.« 


lim > A) Yo. 
io & 


It follows that 


tte te 


Also, 


as desired. 
) *M(Y2) = Ao. 


This completes the proof. 

We remark that for the proofs of theorems 5.1 
and 5.2 the conditions (21) and (23) could have 
been relaxed by using A,—A, in place of \f/=—A,—A). 

We conclude this section with a reformulation of 
theorems 5.1 and 5.2 for convergence to the two 
highest characteristic vectors y,, y,—). 

Theorem 5.3. Let the constant @ satisfy eq (21). 
Define the sequence {z'} by the recursion formula 


=27°+ at. 
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on 
Then 


lim z'=Ly,, limy(r')=r,, L> 


If, further, @ satisfies eq (23,) then 


lim lim u(£") 
i--e i+ 

This result is obtained by replacing A by B=—A 
in theorems 5.1 and 5.2, and noticing that the char- 
acteristic roots become 


x r Tee ¥ 


with corresponding characteristic vectors y,, ¥,.;, 
. yy. 


VI. Rate of Convergence for Fixed a 


In this section we investigate the rate of conver- 
gence of the sequences{z‘}, | £\and related sequences; 
in the rest of this section we assume eq (23) holds. 
For convenience we introduce the so-called “ratio” 
of a sequence of real numbers as a measure of speed 
of convergence to 0, and develop some elementary 
properties of sequences with ratios. This notion 
will also be useful in our later generalization of the 
method of fixed a. 

Let us agree that a sequence {6‘) of real numbers 
will be called positive (negative) if 6°>0(<0) for 
i>iy, iy fixed. We shall understand that the se- 
quence is monotonic in case it is monotonic for i> 7». 

Definition. A sequence {6'} of real numbers will 
be said to have the ratio « in case 


r=" with «>0. 


Necessarily a sequence with a ratio is either positive 
or negative. 

The reason for this definition lies in the next 
lemma, which is essentially a rephrasing of Lemma 
5.2. 

Lemma 6.1. The sequences {a‘}, {aj/a} G,k=1, 

- +, r) have ratios 6,, &/6,, respectively. 

Sequences with ratios resemble geometric progres- 
sions. If {6°} has ratio «, then for an arbitrary 
e>0O, with ex, there are numbers 7,>0, 
such that 


Ti\(x—e)'< b' < Tok +e)’. 


Thus, if «<1, then 6* tends to zero more rapidly, 
eventually, than a geometric progression with ratio 
«+e, and more slowly than the progression with 
ratio x—e. Accordingly the ratio of a sequence is a 
measure of the speed of convergence to zero. 
shall express our results on rate of convergence in 
these terms. Notice that when «<1 the sequence {b*} 
is monotonic; decreasing if the sequence is positive, 
increasing if negative. 


We | 


T.>0 
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Suppose {6‘}, fe} have ratios «, x respectiy oly 
Then clearly {b'e'} has ratio «x. and {b‘/e*} has ; \tio 
x /k. A convergent sequence {d‘} with a non -ere 
limit has ratio 1. Consequently each of the seque ees 
{bid"}, {b/d*} has ratio «. The following lemma wil) 
be used frequently. 

Lemma 6.2. Let {b'} have ratio x. Suppose 
is a sequence such that d'/b' has a nonzero Jini 
Then {d‘} has ratio x. 

For, d‘/b* has ratio 1, and the product d' 
has ratio x. 

If «#1, then the sequence of differences b4—**' also 
has ratio «x. For this we apply the last lemma with 
dt*—bi—b'*", 


Lemma 6.3 


(dub \} 


Suppose {b'} has ratio 61. Let 


B SD by. 


i ;! 
lim ji 
In particular, |‘; has ratio 64. 

Clearly we may assume that | 4‘) is positive. Se- 
lect an arbitrary « with 0<e< 6, 5+e<1. Then for 
sufficiently large, (6—«)b'<b'*'<(6+ €)b', (6 
b'*?<(6+-6)*b', and in general 


e)*b < 


(6—e)/b'<b't4<(5+6)/b', 
Hence, summing, 


l l 
<3-< 
1—é6+e-6b' -1—é—e 


Bo B l 
bi = lim h < . 


<lit 
o— *~1—é—e 


1—6s4 


Since ¢ is arbitrary, the desired conclusion follows. 
Lemma 6.4. Suppose | 5‘) is a positive sequence 
with ratio 6<1. Let 


Qi= (1+ 5%). 
j=i 
Then 
1 1 


lim ee Pe 


i-@ 


The existence of the infinite product Q‘ follows 
from the fact that the series >) 6’ of positive terms 


j=1 
converges. Now 


i+l 
(1 +64=14+(6'+6't'+ ... +6'* 


+(b'b*t'+ 2. +... +(OtOtt" . . . FF". 


When i is sufficiently large for 6’, 7 >i, to be positive, 





~~ —_— & oe 


i+l i4+l 
>> Os 1 +64)—1< B+ (BYP + 
i i 


ing | ©, we obtain 
— 
B <¢- 


ling by 6‘, letting +>, and using lemma 6.3 
tain the desired result. 
are now ready to state the main results of the 
mn 
corem 6.1. Let eq (23) hold. 
ve sequence with ratio 6. 
his result is an immediate consequence of lemmas 
6.1, and 6.2. 

From this theorem it follows that u‘—y' 
sequence with ratio &. Likewise, ¢® is a sequence 
with the same ratio, as we see from lemma 5.3. 

Lemma The positive sequence L—a, 
ratlo 09. 

The number ZL is that of theorem 5.1. 

L. From eq (16), 


Then w'—A, isa 


‘lis also a 


6.5. has 


We have 


+ o(u*— X,) 


Since »«*—X, has ratio 63, the conclusion follows from 
lemma 6.4. 

Theorem 6.2. Let eq (23) hold. 
r'—Ly,, has ratio 6. 

From (16) 


Then the sequence 


r—Ly,=(a,\— Dy, +azy.+ ...+aty,. (26) 


Thus 


> 2 
— : 


r— Ly, ?=(a,—D)?+a,+ ... +a’. 


Hence the left side is a sum of terms having ratios 
63, 63, &,. . ., &. Since each of these numbers, other 
than the second, is strictly smaller than the second, 
the theorem is established. 

Concerning the convergence of |z" to L, we remark 
that L—\r'| is a sequence of ratio 6. The proof is 
like that of lemma 6.5, with a) replaced by .z‘|?. 

Information on the convergence of each com- 
ponent of 2‘ can be derived from eq (26). We write 
r'=(b}, b3,--+, b8), yr=(e, C2, +++, en). Now for 
fixed j, 6s —Le, is a linear combination of a\— L, a}, 

-, a, these sequences having respective ratios 6, 
bo, b,-++, 6. If the j” component of yp, the 
coefficient of aj, is not 0, then 6:—L c; has ratio 6; 
accordingly the difference sequence 6} —6}*', which 
is numerically available, has ratio 6. If the j™ 


component of y2 is 0, then, in general, 6;— Le, will | 


have a smaller ratio (assuming the sequence is not 
identically 0). 

For results on the convergence of sequences asso- 
ciated with the second characteristic vector y., we 
appeal to the forthcoming theorems of section 11. 
These results may be specialized to the sequence 

t';} by recognizing that in the notation of that 
section, 
a1,  € 22) My we 
Also, we agree to interpet 6,,, as 0. 

Concerning the rate of convergence of £ to Owe 
have, from lemma 5.3, that (& | is a sequence with 
ratio 4». 


Let eq (23) hold and set n'=€' 


-d., then ‘Xr, 


Theorem 6.3. 
(1) 1f & u(£'). isa positive, monotonic 
sequence with ratio &, and | »'—y. is a sequence 
with ratio 6). 
(2) If &< 6, then /u(é')—d,) isa positive, monotonic 
sequence with ratio (6,/6,)*, and | »'—y, 
quence with ratio 65/4,. 


(3) If & 


Is a SeC- 


6;, then w(£") — A, = O18) and | n' — y, = O15). 

These statements are interpretations of theorems 
11.1, 11.2, and their corollaries for j;=2. From 
theorem 11.4 we derive the inequality 


Ao— wlE') S(r')— Ay, 

the equality holding just in case r=2. From this 
and the above theorem we obtain the following in- 
teresting corollary. 
If & 


Corollary. >d,, then for sufficiently large j, 


Ao— w(E), S[Ai— wz"). 

More generally, when A,— y(£') is nonnegative, 
and 7 is large, then yu(£') is closer to A, than p(s‘) is 
to \,;. On the other hand, for case (2) of theorem 
6.3, the sequence {yu(t')—A,} has ratio (4,/6,)?> 68. 
But & is the ratio of {yu'—r, Hence in this 
instance uw‘ approaches A, more rapidly than y(é') 
approaches oy. 

Notice that for r=2, 
automatic. If r>2, then, from theorem 
inequality & > 4, holds whenever 


the inequality 6 >46, is 
11.3, the 


VII. The Method of Optimum a 


In sections 10 and 11 the method of fixed @ will 
be extended to obtain sequences that converge to all 
of the characteristic vectors ¥,, y2, - -- .¥-. Before 
proceeding to this generalization we wish to develop 
first the method of optimum a, so that a comparison 
of the two methods (and others) may be made at 
the most advantageous point. The optimum pro- 





cedure does not appear to generalize to a larger 
number of characteristic vectors as simply as the 
method of fixed a. 

According to eq (10) the iteration scheme is now 
given by eq (12) with 


9 


v >0. 


9 


sitve"+tat” 


The numbers ¢‘ are bounded (see remarks preceding 
theorem 5.1); so are the numbers s‘. Thus 
f(y')=7' is bounded from below by a positive con- 
stant. Accordingly, theorem 4.1 yields the follow- 
ing lemma. 

Lemma 7.1. Let y‘ be given by eq (31). Then the 
conclusion of theorem 4.1 holds. 


establish the boundedness of the y‘. This is done in 
the next lemma. 
Lemma 7.2. Let y'‘ be given by eq (31). Then 


— 
lim + *i—i 


Since uw‘), it is sufficient, by eq (11), to show that 


lim u(f")> dg. 


bow 


Let 7 denote the inferior limit on the left. Let 
om 
\€*| 


Then T=lim y(n‘). Select a subsequence {n*} such 
that 7 =Tim y(n). Select a further subsequence 
{n'} that” converges to, say, 7. From (n‘,2)=0 
and lemma 7.1 we obtain 


n 


: z' 
O0=lim ( ', =) =(, ¥:). 


le 


Since 7 belongs to «4 (space spanned by y,, - - -, y,) 


it follows that u(n)>A.. Hence 


T=lim u(n')=u(n)> Aa, 


as desired. —_ 
Theorem 4.2 now yields the principal result. 
Theorem 7.1. Let y' be given by eq (27). Then 


lim z'=Ly, 


tow 


lim p' —>h. 
t-+@ 


That the vectors £/\é| do not converge in the 
method of optimum a is a consequence of the next 
theorem. 


Theorem 7.2. Under the hypotheses of theorem 
qb, 
(e, &*)=0. 


We have, 


| by 





Hence 


i. )) =: (e, Ar')—y'( AF, &)+- yp! + ly *(€*, £°) 
={1l—y‘(u(é)—w'*")} (€, &4) 
=0 


by eq (11). 

The last theorem indicates that the coefficients a! of 
eq (16) converge in an irregular fashion as i —- o, 
Hence a determination of the rate of convergence 
such as that given in section 5 for the method of fixed 


— | aisnot to be expected here. 
In order to apply theorem 4.2 it is necessary to | I 


We conclude the section with a modification of 
theorem 7.1 for convergence to the highest char- 
acteristic vector ¥,. 

Theorem 7.3. Let the sequence {2'} be defined 
zitla=zity,eF, 
with 

2 


— +as+ ue 


L>0. 


¥' == 
Then 


lim z'>Ly,, 


i-+@ 


This result is obtained by applying theorem 7.1 
to the matrix B= —A. 


VIII. Comparison of Methods 


Each step in the preceding methods of gradients 
consists of forming the subspace spanned by r and 
§, i. e., by z and Az, and choosing in that two-space 
a next approximation 7. In the method of optimum 
a, the vector 7 is chosen so that u(Z7) is a minimum 
in the subspace; in the method of fixed a the parti- 
cular linear combination s—aé is chosen. Superfici- 
ally, then, it would seem that the first method 
should give more rapid convergence to dr, and y, 
than the second. But the minimizing procedure can 
only be recognized as an advantage when the vector 
z‘ is common to the two methods. This certainly 
holds for the initial vector 2°, but not necessarily 
beyond this stage; hence the relative merits of the 
two methods are not evident. It seems reasonable 
that for low order matrices the procedure of finding 
an optimum a@ might be advantageous, while for 
matrices of higher order the fixed a procedure might 
be superior. In practice a combination of the two 
methods would be in order. 

The method of fixed a has the advantage of com- 
putational simplicity over the alternative method. 
On the other hand the former requires some advance 


© We shall remark later on the possibility of dealing with subspaces of dimension 
higher than two. Cf. a forthcoming paper by W. Karush, An iterative method 
for finding characteristic vectors of a symmetric matrix. There the method of 
optimum a is generalized to subspace of arbitrary (fixed) dimension. 





ation on the characteristic roots in order to 
ite an allowable a; computation by the method 
‘imum a@ demands no such knowledge. 
- method of fixed @ is a smooth method. The 
rs £'/& converge; by a relatively simple ex- 
nm of the procedure we can obtain convergence 
the characteristic vectors of .4 (see section 10). 
ermore the convergence is geometric in nature. 
nethod of optimum @ has none of these advan- 
ta The successive gradients are orthogonal, and 
the coefficients @, a}, of eq (16) do not tend to 
Os oothly. 

Sequences {6} with ratio 6<1 provide examples of 
“linear” convergence. If the ratio 6'*'/(b')? has a 
finite limit, then the sequence converges in a “quad- 
retic’ fashion. In the method of fixed a we have 
convergence of a linear type. It is possible to 
procure quadratic convergence by modifying the 
method, but the price paid would be the solution of a 
system of linear equations at each step of the 
iteration. 

\ combination of the two gradient methods can 
be used to advantage when the characteristic roots 
\,, A, are close and relatively isolated from the other 
roots. Suppose we begin with the fixed @ iteration 
and continue up to a certain stage. The last esti- 
mate sand its gradient — may not be good individual 
approximations to y; and y. but the pair may provide 
an excellent approximation to the plane of y, and y. 
One application of the optimum @ procedure at this 
stage should then vield a good approximation F to y. 
The iteration may be started again with 7. This 
technique has worked well in several numerical 
examples. 

Since t='£/\2|->0, it is clear that fewer and fewer 
significant figures will be retained in é as the iteration 
proceeds in a numerical calculation. Nevertheless, 
é and w(é) supply useful approximations to y and 
\». For example, after y, and \, have been obtained, 
the approximation £ can be used as the initial vector 
in the calculation of y, and d, by a new use of the 
method of gradients. 

The above remarks apply equally well to the 
highest vector y,. One of the advantages of the 
gradient methods is that they can be applied directly 
to either end of the scale of characteristic roots. 

A single step (2) of the gradient procedure 
carries one from 


L=QyYi t+ q+ -++4,y, 


1+al(u—A))}ayyi+ fl + alu —dz)} doy2+- 2%, 


Thus if any characteristic number \, is known, then 
orthogonality of F to y,; can be achieved by choosing’ 


1 


a= . 
Aju 


= 


’ The same result ean be obtained by taking 7= Ar—)jz; the advantage of the 
above procedure is that it fits into the iteration scheme (12 


This observation may be used to maintain (approxi- 
mate) orthogonality which may be theoretically 
assured but may be gradually lost in computation 
due to round-off errors. For example, in beginning 
with £ as an initial vector for obtaining y, (see 
| second paragraph above), one must maintain ortho- 
| gonalitvy to y, during the iteration process. To do 
this, the procedure (12) may be interspersed with 
steps in which y'=(A,;—u')7~'. Similarly, one may 
maintain orthogonality to any number of charac- 
teristic vectors, or induce orthogonality to such 
vectors, provided only that one knows the cor- 
responding characteristic roots (relatively accur- 
ately). The desired ends are achieved within the 
framework of the scheme (12) by particular choices 
of y'. Notice that if r—1 of the r characteristic 
roots A, are known then the characteristic vector 
belonging to the remaining root can be obtained in 
r—1 steps. 

It was remarked after theorem 6.1 that Ay' 
u'*'—yu' and ¢ are sequences with ratio &. Hence 
observation of the ratios Au'*'/Au' or t'*'/t' can lead 
to an estimate of &. Since 6,.=1—a(A,—),), and @ 
is known, another estimate of A, may be made, 
alternative to the estimate u(é). Furthermore, if a 
sequence ¢' has a limit (, and the difference e'—C 
is a sequence with a ratio 6< 1, then a transforma- 
tion for speeding the convergence of ¢' is 
Ac* ( 5! 

1—6 


Ac’ 
Ac 


7! 


c+ 


-) 


cit! = bic! 
i—¢d 


Gt! — ae? 
1—é6 


=~ 
= 


With an estimate of 6,, this formula may be used 
to improve uv‘ and the components of rs‘ (see remarks 
following theorem 6.2). 

A certain degree of flexibility may be introduced 
in the method of fixed a by replacing the constant 
a by a variable a’, 

B' 
a’ , (28 
M ate 
which satisfies 


0< 8B, <6'<8.<2, (B;, B. constants). (29) 
The conclusion of theorem 5.1 remains valid, since 
this choice of a' does not violate the assumptions of 
theorem 4.2. 

The convergence of the gradients as in theorem 5.2 
may also be justified provided one imposes the addi- 
tional restrictions 


B.<1 and 22<s me, 


% X (30) 
B, “eA, 


| 
Under these conditions statements on rate of con- 
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vergence, appropriately formulated, can be estab- 
lished. 

For comparison purposes we state here, without 
proof, some properties of the “power” method. 
The iteration formula is 


Az'. 


Suppose now that there is one characteristic root 
of maximum absolute value; denote it by A,. Then 


21 


: f 
lim sy. 
' «a 7 


lim y(r')-—r, 
rae 


(In the first limit the sequence with odd indices 
tends to —y, if y<0, and to y, if A >0.) It was 
pointed out in the introduction that the power 
method is a gradient method with 


and the convergence result is 


lim 2°'=K y%, K 


const. 


(provided yu‘ is never 0). Thus yu‘, as well as 2‘, 
may be used as a normalization factor. 

The power method leads directly to an estimate of 
A,, and hence to an upper bound 2, of VW. Thus 
a rough application of this method may be used as 
a preliminary step in the method of fixed @ in order 
to determine an allowable value of the constant a. 

Suppose now that A, is a characteristic root of 
next highest absolute value. Concerning rate of 
convergence we remark that u'— dA, is a sequence of 
ratio (X;/A,)?. If—xX, is not a characteristic root, 
then £'= &(2*") tends in direction to 


Oisc—h) 
ees WEL 
the sequence with odd indices tends to the negative 
of this vector if \,<(0 otherwise to the same vector. 

We conclude this section with a remark on the 
method of relaxations... Any method which begins 
with a vector z and applies a sequence of transforma- 


tions to it so that the resulting sequence of vectors | 


x‘ converges to a characteristic vector, must induce 
the gradients £(x') to tend to the null vector. Thus 
any artful modification of a vector z to a new vector 
Z which brings the gradient closer to the null vector 


* Cooper, J. L. B., The Solution of Natural Frequency Equations by Relaxa- 
tion Methods, Qtrly. Appl. Math., vol. 6, p. 179 (1948). 


would be a plausible procedure in a numerical ce - 
culation. The full skill and intuition of the comput 
may come into play in varying the vector at any 
stage to produce a_ better approximation. QO; 
systematic procedure is to modify a single component 
of z in such a way as to make a single component ; 
the gradient vanish. Other devices may be usec: 
the best trick at any point depends upon the infor- 
mation available at that stage. This is the flexible 
approach of the method of yelentions. Clearly it is 
not easily adaptable to automatic computation. 


IX. Invariant Subspace «4 


Before turning to the generalization of the method 
of fixed a we shall develop here some properties of 
the invariant subspace <4. We will encounter cer- 
tain polynomials p,(A) which have been considered 
by Lanczos,’ and some of the results of this section 
will overlap his work. 

In place of the notation z° we shall use 2, to de- 
note a given vector with expansion 


Zi =A,Y) + AY + « «% aU, (31) 


asin eq (4). The space .4=(y, - - -, y-) may be char- 
acterized as in the following lemma. 

Lemma 9.1. The vectors 2,, A2z;, - 
span the space <A. 

Let .4 be the space spanned by the vectors A*z,, 
&==0, 1, 2, Then this space is the smallest in- 
variant subspace which contains the vector z,. Since 
z, isin .A, and .</ is invariant, it follows that the 
relation B<.+4 holds. 

Suppose that .# were a proper subset of the invar- 
iant space «4. Then .4 would be spanned by a 
proper subset of the characteristic vectors y), ys, 

-, y, of .%. Consequently the vector z,, being 
in 4, would be orthogonal to at least one of these 
characteristic vectors, say y,. Thus, from eq (31), 
a,=(2;,y)=0. This contradicts the assumption 
a, >0 (see eq 4). 

We now have .4=.:4. Consequently, dim .4=dim 
A=r. The lemma now follows from the fact that 
dim 4 is the least integer A such that the vectors 2,, 
Az,,..., A*-'z, span #. 

We now define the vectors 2, recursively as fol- 
lows. Let 


Then 


Az,;— 4121 
Az.— 222-82, 


> (33) 





~ am AS ~ _ Doon, ft ‘ 
241 AZy— py2Zy—lZ;-), j=2, 3, oe » %J 


***An Iteration Method for the Solution of the Eigenvalue Problem of Linear 
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th 
he 
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In 


tOTS 2}, 22, - - « » 27, 2-4; are well-defined and 


2,0, > ae 


pose, 2), , 2, are defined and none is null, 
Then by eq (33) 2e4,) is defined and is a linear 
ation of A®z,, A*-'z,,..., {z,, 2; with the 
nt of A*z, equal toone. Further, if k<r—1, 
., #0 by lemma 9.1. This establishes the 
m. 

duction it is easy to prove that 


l. 
(34) 


ma 9.2. Let 4 be the space spanned by 2), 
, A . Zt, i. &., 


Then ., 2) isan orthogonal basis for «4. 


Also, 


By lemma (9.1), has dimension j. From the 
above, the nonnull vectors (2, , 2,) are orthogonal, 
and lie in .A; hence they span .-/. 

The final conelusion of the lemma follows from 
the fact that z,,, is orthogonal to 2,,...,2, and 
hence to .%. But from the invariance of .<4, 2,.; 
belongs to .4. Thus z,,,—0. 

Given an arbitrary vector 2, the integer r of eq 
31)can be determined from the above lemma; r is 
that integer such that z,,, is the first 2, that van- 
ishes. Notice that except for j=r, <4 is not the 
space spanned by y;.. . ., yy. 

We consider now the expansions of the z, in terms 
(31) and (33), 


of the y; From eq 


Ay My) QyYit(Ag— wi) 2424 see ets ~ Mi) UY, 


Thus, if we put p,(A)=A— yn, we have 
Pi(y) QyYit Pr (Ae) Gaye - +. py (A,) @,Y,. 
Again from eq (33), 
23= fp, (A) (A, — a2) —B} ayy, 4 
+ {p, (A) (Ap — 2) — 88} yp. 


Hence, if we put po(A)=p, (A) (A—2)—t3 we have 


Po (Ay) Gy Yi + P2 (Az) Gayot + po(A,) @,Y>. 


In general, if we define the polynomials p,(A) by 


Pol A)= 1, Pi (A) =A— wy, Py (A)= Py-1 (A) A— By) 


—t py_2(d),J=2,°°°, 7 


(35) | 
55 


we have the expansions 


2541 Py (Ai) yy Py (Ag) Cayo b Py (Ar) Ber, 


g=0,1,-+-,F. (36) 


From eq (36) with j=r+1, and lemma 9.2 


that the roots of 


we see 


Pr (A) =0 


are the characteristic numbers \,, Aj, +++, Ay. This 
is a special case of the forthcoming theorem 9.2. 

In lemma 9.2 we defined the subspaces J, of J, 
In order to develop some properties of these sub- 
spaces we state without proof several well known 
properties of symmetric linear operators. 

Let .4 denote a linear space of real vectors. Let 
B denote a symmetric linear operator on 4. Con- 
sider an /-dimensional subspace .4’ of .4; let r be 
the projection operator into .4’. The operator +B 
with domain restricted to .4’ is a symmetric opera- 
tor on .4’; by the characteristic roots and vectors of 
B relative to 4’ we mean the corresponding quan- 
tities of this operator. An alternative characteriza- 
tion is given by the property that ¢ is a character- 
istic vector (relative to 4’) belonging to the char- 
acteristic root « if and only if ¢ belongs to 4’, vr #0, 
and 

(x, Br)=x(z, v), rin 2’. 
Also, the smallest and greatest roots relative to 2’ 
are respectively the minimum and maximum of u(r) 
on 4’. If.’ is an invariant subspace of .4 then 
tB=B on .4' and the characteristic roots and 
vectors relative to 4’ are such in the whole space 
4: this is the case with ./ as a subspace of ./. 

If w,,U.,-*+,u; is an orthonormal basis for .7’ 
then the operator +B relative to this basis has the 
// matrix representation 


B’ =(Au,, uy), pk 


The characteristic roots relative to .4’ are the roots 


of 
AJ —B’ —0. 


We quote a well-known result. (The first conclusion 
of the theorem below holds without the restriction 
that the «x; be distinct.) 

Theorem 9.1. Let Bbe a symmetric linear opera- 
tor on an m-dimensional real linear space .4; let 2’ 
be an /-dimensional subspace of .4. reeds that B 
has distinct characteristic values «K,< K< +++ < km 
with corresponding characteristic vectors 7), 02,° - 
tm. Let the characteristic roots and vectors of B 
relative to 4’ be respectively «i <xj<---<«; and 
v’,0y,-°°, vj. Then 


KS Ki, KeSk KK. 


The equality holds for all indices if and only if 





in this case .4’ is an invariant subspace. 

We return to the subspaces 4. The iiidaasits 
istic roots and vectors of A relative to /, are those 
of the symmetric operator B,=x,A on “4, where x, 
is the projection on «4. Since 2,,.... z, is an 
orthogonal basis of -/ the operator B, has the matrix 
representation 


(SEsut=) 


-i -m 


lim 


We calculate this matrix. 
Lemma 9.3. For a given j, 


when & 

when k 

when k=j+1 
otherwise. 


We remark that for j= 1, the first equation is to be 
omitted. The first and third equations follow from 
the first equation of (34). he second 
from the definition of u,. 
of the orthogonality relations of (34) and the fact 
that Az, is a linear combination of 2,,,, 2; and 2,_, by 
(33). 

It follows from this lemma that B, has the matrix 
representation 


f uw, t, 0 
ty pe ts 
O ts ps 


O 


me ee 
er 


O t; my P 








. 


Lemma 9.4.% Let go) =1 and q,(d) be the char- 


acteristic polynomial of B,, 


qi()=|AJ—B,;|, j=1,2,--- 


Then 


Pi (=, ), g=0,1,---, 8. 


Using the matrix representation above it is easy 
to see by direct calculation that 


qi(A)=~A Gs (XN =(A— wy) Qj-10) — 8 qy-2 (A). 


Bi, 


This result follows from the minimax principle for chs aracteristic values, 
Cf. Courant and Hilbert “Methoden der Mathematischen Physik,” 1, 2d ed , 
Berlin (1931), pp. 27-29. 





follows | 
The last is a consequence | 
1 





The lemma now follows from the fact that these ¢- TI 
cursion relations are identical with eq (35). 
Theorem 9.2. The roots »,<m< ... <v, of 


pAN)=0, 7 


7 
are the characteristic roots of A relative to the sub. mate 
space “4. Furthermore, 

lin 
A, Sy, oS Pr, eeey Ay Sy;, 

| the equality holding for all indices if and only if 
re By t 
This theorem follows from theorem 9.1 with we si 
4= 4, B=A. The last statement follows from the 
the fact that 4, j<r, is not invariant. hold 
theo 
X. Extension of Method of Fixed « | 
em 

We return now to the sequence {2') of eq (12) with 

y'=a=const. As in theorem 5.2 we impose the 

| condition (23)" on a We rename ras 2 and apply Also 
the results of the previous section to each 2. The 

_ expansion (31) becomes (16), 

Say +e@sye4 +A Yr. Con 


By lemma 5.1 each a‘ is positive; accordingly, the P: 
space ./ associated with 2, is independent of i and 
coincides with our original space 4. We recognize 
>, as our originial £'. 

The polynomials of eq (35) now depend upon the The 
index 7; we denote them by pi(A). For each 7 they 
are defined in terms of 


uy =p(2}), Fron 
36), 
It is convenient to introduce 
Pol) = 1 ’ 
BAA) =(A—AJ(A—Ay_) - -A—AD), q 
vie 
j= 1, 2,-+ +, #. 
The vectors (2), 2},- - -, 2!) comprise a sequence of 
| orthogonal bases for the space «4. We show that, 
when normalized, this sequence converges to the 
fixed basis y;, - yr these 
Theorem 10.1. st the constant a@ satisfy (23) 
_and let the initial vector 2?}=<2° be given by (4). 
| Determine the infinite sequences {2}, {23},- +--+, | 
| by (33) and 
2,**=2}— az}. P 
‘In this section and the next we assume without further comment that this 3 \ le 


condition holds. 


lim w(zi)=A,;, 7 . vy 
exe 


oof will be made by induction. For an 


2<k<r, consider the statement 


lim Pi-10), j=1,2,-->+, k. 


ie GG; 
(37) 
em 5.1, (5.2), lemma 5.3 and theorem 5.2 
wesee that the statement is true for k= 2. 
the statement true for k<r, we shall show that it 
holds for k+1. This will provide a proof of the 
theorem ee 
From the second eq (37), the definition of f, and 
lemma 5.2 it follows that 


By th 


t-0, j=2,--- 
\lso by the first eq (37) 


pi a;j, J Se eee 


Consequently, from eq (35), p\(A)-A—A, =P, (A), 


(A Ap) Pr (A) =P. (A), and generally, 


Fe 


P(X) pj), y) 
The vector under examination is 


PAIDQYL+ PAVDAY2 +: > > +p A/)aly,. (38) 


From the orthogonality relation (2), 2,.,)=0 and eq 


36), 


% pr(d,)pi(dai =0 


> pi_. Qd)) pia) ae 


Dividing by a,,, and writing 


pr)ae 


Oy +1 
these equations take the form 


Po (Ay) a ? Po (Ag) beset Po (Ax) b, 


Ay) a + pis (Ag) ag ++ - +4 pi (Ay) a = Bj}. 


D.2, 


Assuming | 


As i->@ the matrix of coefficients (p')(A,)) tends to 
(pi\rd). The latter matrix has only zeros below the 
main diagonal; its determinant is the product of the 
diagonal elements, namely, 

Pol A Pr(a) © - . De Oy). 
From the definition of the polynomials p, and the 
fact that the A, are distinct, it follows that this quan- 
tity does not vanish. Consequently, for 7 sufficiently 
large, the above linear equations in a may be solved 
for these unknowns. Furthermore, as 7 tends to 
infinity, the solutions have finite limits, say L,. 
Hence 

@d-—-L, j=1,2,...,8. (39) 
It is simple to compute L,; allowing (> © , we obtain, 
from the last equation, 


Peds } Pr (Aged) 
L, — _ ' 
Dr Ay) 

For our purposes it is not essential to know more of 
the limiting values L, than the fact that they exist. 
As a matter of interest they are evaluated at the end 
of the section. 

Now divide both sides of eq (38) by aj... 
lemma 5.2 and eq (39), 


From 


~t 
~ 


> Px (News) Yea 


. 
Ag+ 


From this we immediately obtain the eq (37) for 
j=k+1. This completes the proof. 
For future use we record some of the results of 
the above proof in the following corollary. 
Corollary. For j=0,1,---,7, 


Pp. (A) PA), > pj (A,). 


i 
1; 


Also, for j=1,2,---,/—1,1 
Pi (A, a* 


a; 


_ has a finite limit; this limit has the value 


_ ip (Ay) Pi 2(A2) 
Pi—2(Ai-1) 
when 


We return to the evaluation of L, of eq (39). Let 


L, 


° (40 
Pid 1) _ 


i 


| 
bi—>p, (rx, | Then the 8, satisfy the limiting equations 


57 


1) Px (Ae a1)s 





8,4 Bst.. .+ By z—] 


Prd) B, 4 Pid) Bo T + Di (2) By — Di (Aeas) 


Pr (A) 8.4 Pr (Az) Be +P, (Ax) By —Px i(Ae41)- 
In the second equation, D(A) is a polynomial of 
degree 1; by adding \, times the first equation to 
the second we may reduce the second to 

A,B, T AoBo , ae ArBy Neai- 
In the third equation, P,(A) is a second-degree poly- 
nomial; a linear combination of the preceding two 
rows will reduce this equation to 

ATB, T A382 + AEB: = Ni+i- 
Continuing in this way, 
system, 


we arrive at the equivalent 


Bi+ Bit... + B.=—1 


AiBi+ A2B2+ ee 


i 'B, t Ag 'Bs T “T ri "By 


7 k=-1 
—NeFi- 


The solutions 8, are 
terms of the function 


immediately expressible in 


*.(b;, bo, ---, b= II (b, 


i<m 
I, m=1,2,---,k 


—b,). 


From the identity 


Vi(bi, ba, - ++, 
= bs! — 
it follows that 


Vig, soe 


B,—(—1)* = » Ae, Aner) 


Vi (Aa, Aa, + + + 5 Ae) 


Vi (A, As, ees » Ar, Ars ) 


=(—1)**' ¥ 
) Vi (Ai, Aa, + + =, Ae) 


a=— Vi (An, » Art, Neva) 
. Vi (Ay, Aa, — i’ Az) 


The L, may now be computed from eq (40). 





| 
| 
| 
| where, 
have finite limits. 


| Now suppose 6? > 


= above-m 
| (41) has the limit 


_ consider the sequence { u(z;)). 


XI. Rate of Convergence 


We shall establish some results on rate of conv: 
gence for the sequences of the previous secti 
The terminology is that of section 6. Reeall ; 
definition of 6;,,7=1,2,.. .,7, ineq (25). It ise 


| venient to set 


=@. 


We remark that for the lengths 2: we have 
once, from the corollary to theorem 10.1 and lemm: 
6.2, that {/2}|} is a sequence with ratio 4). 

Theorem 11.1. For a fixed integer j, j7=2,3,. 
(1) lf & >6,_)- es 4s P Xb 

A, —u(2)} is a positive, a sequence with 
ratio (6,/6,_,)*. (2) If &<6,_,-6,,,, then {y(z')—\, 
isa positive, monotonic seque nee with ratio (4,,,/4;" 

We first assume j<r. Then from eq (36) 


(25, Azi)—A,/\25/? 


u(zi)—v, 


3 {n- d,) [pi (A)aj}?>+ . * 


t+ (Ay 1— Ay) [p' (Ay_as_,]}? 


+(Ajys1—Ay) [pi (Ajy4 as 4/2 + 


2 ( 2 

a’ a’ , 

a | i 
a’ 


Tot lt 
zy 


by the corollary to theorem 10.1, the 4 


> 6; 1554 1- We write 


2 
i 


a 
(u(zi)—A,)) +3 =- 
a’ 


(41) 


The factors of b, (except for b:_,) may be recog- 
— as sequences of ratio less than 1, by lemma 

; hence these sequences tend to 0. Hence, by 
entioned corollary, the left side of eq 


P; - 2(Aj) 
As Fs 0-1) 


Using lemma 6.2 we deduce the first conclusion of 


| the theorem. 


Suppose next that 6?<4,_,4,;,;. Then we write 








he 
he 


lit 


wl 
m: 


then 
vith 
-K, 
6 2 


e by 


(41) 
cog- 
nma 


f eq 


n of 


te 








a’ as { 1° ‘ 

; ( ' 

u(zs)—Ay) 7s b\ a —e ce 
G+. 2; ( a4; +1 


a‘ 1 a‘. ) 
+Ha(| Jott fe4.. -)e 


1a’, 
Al terms tend to 0 except 6!,,. Hence the left 
si has the limit 


WF _ (X541) 
(Aga. — A) >. 
oe ) Pe (Ay) 


From this we deduce the second conclusion of the 
theorem. 

There remains the case j=r. In this instance 
only case (1) of the theorem is possible. The proof 
is identical with that given above for that case, 
except that no terms 6, and higher appear. 

Theorem 11.1 leaves open the possibility of the 
equality 6?=6,_,4,4,.. The likelihood of this condi- 
tion holding in a numerical instance is very slight, 
but the problem is subtler than the instances of in- 
equality and thereby has theoretical interest. 
establish the following corollary. 

Corollary. Suppose &=6,_,6;.,. Then 


p (23) —A,; =O (6?/8?_)). 
To prove the corollary we require the following 
lemma. 
Lemma 11.1. If &=46;_,6;,, then the sequence 
{ai_,@).,) 
( a‘ ) 


’ 


has a finite positive limit. 


We remark that although the sequence in question | 


has ratio 1 it does not follow from this that it has a 
limit. 
By eq (16) 


a), @54, a@)_,a?,, 
2 ane 2 
a’ a° 


i 3 


Sp ay 49} (1a Qye.—0) 
k=0 {1—a(A;—p")}? 

(42) 

where p*=yp (2%). 

may be written 


[6y-1 +a (u*—2,)] [6)41 + a(u*—A))] 
[6,+ a(u*—A,)]? 





Letting 


B*=p*—d,, K, =— ? K . = 


We | 


The general term of the product | 





| the general term becomes 


(1+ 1,61 + Kp") 
(1+A,6*)? 
where 
b*— pK, +k, 


2K) +higher terms in 3° 


The coefficient of 3* is positive. For, 


‘ . neh «= 2 
K,+K,— 2k, ( +. -2) 

6, \Gj-1  8y41 

a l 6 6 

2 a 1) 

5, (? p ) (» 0;_1 0 

= ts 1)- 0 

6 


‘ 


It follows that for sufficiently large &, b* is positive. 
Furthermore, by theorem 6.1 and lemma 6.2, b* is a 
sequence with ratio 6. Hence >3b* converges; 


kK 
thus the product II(1+6*), and hence eq (42), con- 
verges to a positive limit as i ©. 

We turn to the proof of the corollary. As in the 
proof of the theorem we form eq (41). We see that 
all terms tend to 0 except the terms in 6'_, and 5... 
The first of these has a finite limit, and, by lemma 
11.1, the second does, too. This completes the 
proof. 

Notice that for the corollary we cannot deduce 
that u(2})—A, is a sequence with a ratio; this is be- 
cause we do not know that the right side of eq (41) 
has a nonzero limit. In fact, it seems likely that 
this limit is zero. 

Theorem 11.2. Let j be a fixed integer, j7=2, 3, 

-,r. Set uf=——- 


“se 
(1) If & >6,_,4;.;, then |ui—y, has ratio 6, 5,_,. 
(2) If &<6,_,4;,,, then ui —y,| has ratio 4,,,/6;. 
From the corollary to theorem 10.1 we see that 


pi -10Ay) a5 


lo 
“se 


1 


tends to 0. We wish an estimate of the rate of 
convergence. 


By eq (36), 


25 (7/[pi —1 (As) a) ]?— 1 = 1/[p; 1 s)]? {Ep} -1 On) ah/ay]? + 


vst [pie Ay-1) a) -1/a5]? + [pj 1540) @) 41/05]? +- - -}. 


Denote by c’’ the first term on the left. Under case 
(1) of the theorem we multiply both sides by 


d‘ =(a‘_,/a‘)*. 


As in the proof of theorem 11.1 we find that 
(ce —1)d) has a nonzero limit. Noting that 
1—(1/e')=(e"—1)/e' (1 +e") and using c'->1, we find 


that 
l ' 
( = ‘ye 


has a nonzero limit. (43) 


On the other hand, under case (2) we find, by a 
similar argument that 


(1- ‘ya 41 


has a nonzero limit. 


We have 


. ] : 
- Y) - T7138 Lv. (A)as}? 4 


, 1 
+ [pi ir; da +(, 1) 


+ [pi Aja ay , \. 

Consider condition (1) of the theorem. Multiply 
both sides of the above equation by d;. From eq 
(43) and the corollary to theorem 10.1 we obtain that 


‘us —y,; 7d’ (45) 


has a nonzero limit. 
This proves the first part of the theorem. Consider 
next condition (2). This time we multiply both 
sides of the above equation by d‘,,. Using eq (44) 
we find 

Uys da) 
has a nonzero limit. 


the theorem. 
Corollary. 


This completes the proof of 


Then 


Suppose & = 6,_ ,6,,). 
us — y;| = O(6,/6;_)). 


Using lemma 11.1 we show as in the proof of the 
above theorem that the left side of eq (43) has a finite 
limit. Proceeding further as in the theorem we 
show that the left side of eq (45) also has a finite 
limit. This completes the argument. 

The following result is of interest in connection 
with the preceding theorems. 

Theorem 11.3. Suppose r>3. 
teger with 2<j<r—1. 


Let j be an in- 


ee ey ny ee 
then 
57 > 6;— 1 6j41. 


' coefficient of \’~' is —(uj +... 


Let g;=(A;—2,) M.—-From the definition of the ; 


(1—Bq,)°—U —Bq;-1) 1—Bq;.)) 
Bg? —qy-1 dsl +BD, 


S— 821841 


where 


(9j4i— Gy) —(Gi—-Gi-v 


l 
WV [(Xj41—- Ay) —(Ay—Aj_y)]- 


We find that 


65-1 Oy 41 B* l(qs—4s qi Dd) + BD 


B*(q;—q;-1)° +(8 — B’qj_,) D. 


Since 0<_B<1, and 0<q,;_,<1, it follows that the 
coefficient of )) is positive. Hence the left side js 
positive if Dis. This concludes the proof. 

Notice that if j=r, the conclusion of the theorem 
holds, for, 6,,,—0. Also, if r=2, then j=2 is the 
only meaningful value of j and the conclusion like- 
wise holds. 

Theorem 9.2 enables us to prove an interesting 
result for the errors \;—pz'. 

Theorem 11.4. Let 


e=Aj—wj}, J=H1,2,..., 
Then for each 7, 


ea 
é, +e 


; 
(+> 


Consider a fixed 7, j=1,2,...,7, and a fixed i. 
As in theorem 9.2 let », v, denote the roots of 
p..\)=0. From the definition of the jth degree 
polynomial pi(A) we can verify directly that the 
+u'). Hence 


j 


>a eae 
f=1 t= 


From theorem 9.2, 


l = 
Se = Sai =D Dn= DOS, 


k=1 k 
the equality holding just in case j=r. 
XII. Concluding Remarks 


Associated with a vector 2, is the chain of sub- 
spaces 


“Ach cee “a 


of section 9, together with the corresponding poly- 
nomials p,(\), whose zeros give the characteristic 
roots of A relative to 4. Let »,; denote a mini- 
mum solution of p,(A)=0. It is a consequence of 
theorem 9.1 that 
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12m, 2D +++ Dr = 


if » , denotes the next smallest solution, 


_ arly ’ 


V2, 222,32 +++ DV =z. 

\) logous results hold for the higher characteristic 

Thus, the characteristic roots of A can be 

fo ad by successively finding the zeros of per), 
ete.” The procedure may be followed, for 

ex imple, after a good approximation 2; has been 

obiained by, say, one of the gradient schemes. 

‘he subspaces and their polynomials may be used 
in other Ways in conjunction with the method of 
eradients. It was pointed out earlier that in the 

dient methods we pass in each step from = 2, 
to a next approximation 2, which lies in the subspace 
associated with 2. One might consider choosing 
in “A or even a higher-dimensional subspace. 
The extra labor involved might be justified, or even 
essential, in an ill-eonditioned problem in’ which 
several roots are clustered about \, (or \,). 

To illustrate, suppose that A,, Ay, As are close to- 
vether and reasonably isolated from the other roots. 
\fter a certain number of iterations it would be ex- 
pected that the subspace 4 spanned by 2, 22, 2s 
would contain only the characteristic vectors y,, Yo, 
y, to a good approximation. At this stage, although 
the 2’s might not be good individual approximations 
to the y’s, the characteristic vectors u,, U2, Us relative 
to 4 would provide such approximations. Further 
accuracy for y, would then be obtained by continuing 
the iteration process with 1%. 


We proceed to derive the formulas for the char- 


acteristic roots v,;<v.<v; and characteristic vectors 
;, Us, Us relative to «A. The roots vy are the solu- 
tions of 

Ms) —t(A— py) =0. 


pad) = (A— wy) (A — a) (A— ws) —B(A 


This cubie is most easily solved by introducing the 
new variable p and the constants yu, o, as follows: 
B=—(mitpetus), or=we—e, K=—1,2,3, 
A=u+tp. 

The equation to be solved is now 


pe +bp+e=0, 


where 


b=o,02+- 0.03;+-6,0,——ti, c=tio;+t§ o,—0) 02463. 


The solutions may now be determined rapidly by, say, 


To find u,, the characteristic 
/ belonging to »,, we write 


2+ BZ; 


his is one of the methods developed by Lanczos. 


Newton’s method. 
vector relative to 


U;=—2,; 71 az 
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and attempt to find @ and 8. To do this observe 
first that since u, is a characteristic vector relative 
to ~ belonging to », 


(u, Auy—»,u,)=0, uin.A, k—1,2,3 (46) 
Using eq (46) with & 5 and u ci, @ 3 in turn we 
find, by eq (34), 

0 (pay V;) @ Ze : 
0 a oF Bus By, 
Thus 
vy My 
a 
ty 
a 
pB 
Vi) — Ma 
so that 
v ] 
uy=21+ ( Zo+ Ze ): 
Vy Ms 


The formulas for wu, and u, are obtained by replacing 
vy, by » and vy, respectively. 

Suppose the lower characteristic roots Ay, A... . 
and the higher characteristic roots \,, A, ;, have 
been accurately calculated. The intermediate roots 
and vectors can then be caleulated by the gradient 
procedure (12) as follows. We apply the procedure 
to an initial vector z°, inducing (if necessary) and 
maintaining ee to the vectors 4), Yo, 
and ¥,, Yr... . . by more or less frequent selection 
of y' as 1/(A,;—x4'), with A, ranging over the known 
roots (see section 8). Use of the subspaces dr, tr, 

may come into play as described above. Notice 
that as more roots are found the constant @ in the 
method of fixed a may be chosen larger. 

We remark that independent characteristic vectors 
belonging to a multiple characteristic root can be 
determined by varying the initial vector 2°. 

We conclude by pointing out that the results of 
theorem 10.1 remain valid if the constant a@ satis- 
fying eq (23) is replaced by the variable value of 
eq (28) where, in addition to eq (29) and (30), the 
condition 

Bs min 1—™ 

B, j=3,4, Ny 1 A; 
is imposed. The purpose of this restriction is to 
guarantee that aj/a\->0 for j7<k. The results on 
rate of convergence in section 11 may be modified 
to fit the new conditions. 


Los ANGeLes, May 4, 1950. 





